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TARGET SPECIFIC SCREENS AND THEIR USE FOR 
DISCOVERING SMALL ORGANIC MOLE CULAR PHARMACOPHORES 

This is a continuation-in-part of pending United 
States application 08/286,084 filed August 3, 1994 and 
which is incorporated in its entirety herein. 

FIELD OF THE INVENTION 



10 



15 



20 



25 



30 



35 



The invention relates to a general process by which 
recombinantly derived antibodies (rVab) are engineered and 
selected to identify unique active surfaces of 
pharmaceutical targets . These recombinant antibodies are 
useful as reagents to identify natural or synthetic 
entities which occupy active surfaces of pharmaceutical 
targets and which therefore may be useful as therapeutics. 
This invention also relates to elucidating the three 
dimensional conformations of the various rVabs which bind 
to the pharmaceutical target and confers target regulation 
and the use of high resolution molecular models to 
identify or synthesize biologically active small organic 
molecules useful as viable discovery drug leads. 

BACKGROUND OF THE INVENTION 

Today there are many approaches to identifying 
chemical entities which have a desired effect on a 
pharmaceutical target and therefore potential as drugs. 
Common to all of these processes is the sequential use of 
multiple assays to identify a test compound's composite 
activity profile. This activity profile usually consists 
of information on four basic attributes: potency, 
activity, selectivity and specificity. Selectivity 
indicates the ability to distinguish among closely related 
members of a particular target family. Specificity is the 
ability to distinguish between unrelated targets. Only 
two types of assays are used to develop the activity 
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profile of a potential drug: one, a binding assay to 



measure affinity (i.e. potency) of the compound; and a 
second, an activity assay, to measure the compounds effect 
(i.e. agonistic or antagonistic) on the target. Binding 
assays measure the formation of the complex between target 
^ (T)and ligand (L) . Targets include receptors, enzymes or 
structural components. Ligands include signals such as 
hormones, neurotransmitters, growth factors or test 
compounds. Until recently, L was labelled in some fashion 
(L*) for identification and quantitation of the L:T 

*® complex. Recently, binding assays have been developed 
which use a tagged R (R*) to assess L affinity (see 
below) . All these processes of labelling and R:L complex 
isolation and quantitation are known to those skilled in 
the art and have been reviewed. 

^ In the process of searching for small organic 

molecules with appropriate potency, activity, selectivity 
and specificity for a particular pharmaceutical target, 
the order of testing is most often affinity, activity, 
selectivity and then specificity. In addition, some form 

2® of binding and/or activity assay, is interspersed with 
synthetic chemistry efforts at improving the compounds 
attributes. This generates an iterative cyclic discovery 
processes in which various assays and synthesis are 
repeated over until a compound possessing all of the 

^ desirable properties is obtained. . 

The present iterative process, although successful, 
is extremely time cbnsuming and has a high probability of 
failure for several reasons. Although binding and 
activity assays have now been automated, screening takes 

30 significant time as it is done on individual entities 

within chemical files containing over 100,000 entities. 
In addition, the properties of potency, activity, 
specificity and selectivity are separable, such that the 
presence in a compound of any one property is not 

35 predictive of attaining another. For example, binding 
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assays give no conclusive data on the activity (i.e., a 
compound with high affinity may be an antagonist), and 
activity assays do not predict selectivity or affinity. 
As a result, modifying a compound so as to change one of 
its attributes (i.e., agonist activity) without modifying 
another (i.e., target affinity or selectivity) is 
unpredictable and considerable time is added to the 
discovery program when high affinity compounds identified 
early in the discovery process turn out to have 
inappropriate activity or selectivity. 

The relatively large number of biologically active 
small organic ligands having different general structures 
and which are capable of binding to a particular 
pharmaceutical target suggest that the binding surface of 
the target is not singularly unique. Furthermore, binding 
assays using an endogenous ligand or close analog thereof 
are inherently biased to compounds which bind to only a 
fraction of the available surface of the target- Even 
where the labelled ligand is not an endogenous one, this 
confinement means that the vast majority of active 
compounds identified by this process will be greatly 
restricted to the surface domain of the target which is 
used for interaction with the endogenous ligand. 

This limitation is often viewed as desirable because 
the recognition domain for the endogenous ligand are those 
known via previous studies to have the ability to modify 
target activity. However, investigation of only one 
target area severely restricts the ability to identify 
useful ligands. As endogenous ligands in most instances 
are agonists peptides as in the case of opiate receptors, 
30 antagonist discovery can become a rare event. In 

addition, because endogenous binding domains often exhibit 
limited diversity among receptor members of a single 
target family, it becomes difficult for active compounds 
to discriminate among target family members. This often 
35 occurs when the endogenous signal for the family is a 
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single entity and not a group of closely related entities. 
Acetylcholine (ACh) receptors are an example of a target 
family with only one signal entity. The catecholamine 
receptors are an example of a target family with a few but 
highly related endogenous catechol signals. 

In many cases, target diversity is found in target 
domains other than the specific binding site of the 
endogenous ligand. Some of these domains may be 
associated with the target's other functions, i.e., signal 
transmission while others are quiescent domains not being 
used by any endogenous signals recognition or 
transmission. An example of a dilemma in discriminating 
among target family members is that found for the 
muscarinic receptor family (AChRm) where the binding 
domain for acetylcholine is used to monitor a test 
compound's potency, yet finding AChRm agonists which 
distinguish among the five ACHRm subtypes has proven 
illusive to date. 

The task for drug discovery is to devise a screening 
approach which provides detectable ligands to be used to 
screen compounds which bind to the target and provide 
information regarding potency, activity, specificity and 
selectivity, as well as the three dimension (3D) 
conformation of compounds active at that particular site 
on the pharmaceutical target. 

As part of any solution of these problems it is also 
necessary to establish binding assays which report the 
interaction of test compounds with allosteric modulatory 
sites on targets. An allosteric site is one which 
modifies the endogenous ligand binding site yet is 
discontinuous and non- overlapping with that site. Such 
target sites have important physiological and 
pharmaceutical consequences and have been reported. For 
example, the allosteric site on the Gaba A receptor binds 
benzodiazepines (BDZ) and thereby modulates the binding of 
the endogenous neurotransmitter Gaba. Occupation of the 
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allosteric BDZ site, which can be done by chemicals from 
many unrelated structural groups, has a significant and 
recognized therapeutic influence on physiological 
processes including anxiety and sedation. 

It is also known^ that active allosteric sites exist 
which are modulatory for endogenous ligand binding and 
have observable effects of their own on the target. Such 
an allosteric site is present on the Gaba receptor. 
[Garrett, Blume and Abel 19 86; Garrett, Abel and Blume 
1986] . 

Present screening techniques which monitor direct 
binding of test compounds to allosteric target sites are 
not routinely done because a) high affinity tagged ligands 
which bind to these sites are usually unavailable at the 
start of a discovery program; and b) the necessary 
monitoring of detectable endogenous ligand dissociation or 
bioassays are too time consuming in initial screening 
protocols. Without a simple, rapid and comprehensive way 
to observe all potential target sites, investigation of 
the surface of a pharmaceutical target for potential 
20 modulation remains limited to a small part of the target 
surface. New methods are necessary to survey the entire 
target surface in early screening for discovery leads. 

Recently methods of identifying various entities 
which recognize target surfaces have been reported which 
25 do not depend upon the availability of tagged ligands with 
high affinity for the target. [Delvin, J. J. , Panganiban, 
L.C., and Devlin, P.E., 1990]. These assays detect a 
compounds surface recognition activity directly via 
formation of an identifiable tagged target (T*) : Ligand 
30 complex. In one version, test compound is coupled in 
identifiable compartments to a solid matrix of varied 
composition at concentrations which allow sufficient 
amounts of labelled target to bind and form stable ligand - 
labelled target complexes for subsequent detection via 
35 chemical, radioactive, or biological methods known to 
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those skilled in the art. Subsequent isolation (or 
identification) of test compound from the compartments 
containing labelled target provide active chemical 
structures. In one such version where test compounds are 
free oligonucleotides, the oligonucleotides are isolated 
in complexes with the target, and are amplified and 
sequenced by PCR technology. [Delvin, J.J., Panganiban, 
L.C., and Devlin, P.E., 1990]. 

Phage display is a particularly sensitive method of 
presenting peptide test compounds to a target. Phage may 
be engineered to express the gene encoding the test 
peptide as a fusion protein with one of its surface 
proteins „ Methods involving phage display are referred to 
in Winter et al. PCT application WO 92/20791; Huse, 
WO92/06204; and Ladner et al . WO90/02809. 

Although these newer approaches have now been 
incorporated into random drug screening protocols, they do 
not resolve the following problems: the assays of the 
critical attributes of potency, activity, selectivity and 
specificity are still unconnected; active target surfaces 
including the endogenous ligand site and allosteric sites 
have not been identified; and 3D information on 
conformation of the active agent is not provided. More 
importantly, most of the agents available for screening, 
i.e., peptides, nucleotides, lipids, and carbohydrates 
which are available in large libraries, are not totally 
satisfying as discovery leads because none are expected to 
be orally active, or pass membrane barriers to get at 
intracellular or central nervous system targets. In 
addition, these classes of compounds are so flexible as to 
obscure their active 3D- configuration to such a degree as 
to prevent or severely limit their use as models for 
organic synthetic efforts. An improvement in screening 
would then encompass a resolution of these deficiencies so 
that these .broad surface recognition libraries could 
35 attain their full usefulness. 
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In covering the prior art for high throughput binding 
screens for target modifiers, it also is necessary to 
review what is known of the endogenous ligand signals as 
well as their targets. Both shed significant light on 
additional problems and limitations encountered in the 
binding assays available today for discovery approaches. 

Endogenous ligand signals are those ligands which 
directly modify target activity. The size of endogenous 
ligands varies greatly, ranging from 100 Dal tons (e.g., as 
for glycine in its regulatory role as an excitatory amino 
acid neurotransmitter) to over lOOkD (e.g., as for some 
extracellularly active growth factors (GF) with a 
proportioned increase on surface area. The composition of 
endogenous ligand is equally varied including organics 
such as neurotransmitters; peptides e.g. somatostatin, LH, 
LHRH and TRH; proteins eg., growth factors; and lipids; 
carbohydrates; and inorganics such as ions. 

For discovery purposes, common to all is the desire 
to replace the endogenous ligand with a small organic 
molecule. The problem of screening for replacements 
appears to be very different for most small endogenous 
ligands, i.e, neurotransmitters and neuropeptide 
modulators compared to large endogenous ligands i.e., 
hormones, growth and differentiation factors. Although 
small organic molecules have been found which can be 
active at targets for small endogenous ligands, few, if 
any have been found for the larger molecules such as 
proteins . 

Corresponding to the diversity in endogenous ligands 
is the equally extensive diversity in target domains which 
are responsible for recognizing (i.e., binding) and 
responding to endogenous ligand signals. It is generally 
accepted that both signal and target have specific domains 
involved in forming the actual contact points found within 
the endogenous ligand: target complex (EnI*:T) . .Recent data 
on crystallized growth hormone (GH) and its receptor 
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complex provides detailed molecular information on the 
amino acids within the GH hormone ligand and its target GH 
receptor interactive domains. 

Recent data on the crystal structure of GH and its 
receptor has shown a single GH molecule to contact the 
same set of amino acids in each of two identical GH 
receptor units complexed with one GH molecule. 
[Cunningham and Wells 1989; Cunningham et al . 1991; DeVos, 
et al. 1992] . Bach of the receptor units therefore has 
only one target site which is the same on both units. 
Bach receptor uses the same 7 amino acids to define the 
binding site which participate in GH binding and receptor 
dimerization necessary for activity. [Cunningham and 
Wells 1989; Cunningham et al . 1991; DeVos, et al. 1992]. 

Dimerization of at least two receptor subunits by 
monomeric or multimeric hormones is required for receptor 
activation for the majority of hormones studied to date, 
such as growth factors, including nerve growth factor 
(NGF) , epidermal growth factor (EGF) , fibroblast growth 
factor (FGF) , interleukins (IL2, 4 and 6) , interferons and 
insulin. [DePronzo, Bonadonna, and Ferrannini, 1992; 
Baniborough , Hedgecock and Richards 1994; Kishimoto, et al. 
1994; Claesson- Welsh, 1995] . In some cases, the two units 
of the hormone, as well as receptor are not genetically 
related. In such cases one subunit provides high affinity 
25 hormone binding and the other intracellular signalling 

(e.g., tyrosine kinase activity). [Ullrich, et al . , 1986; 
Kaplan, Martin-Zanco, and Patrada, 1991; Kaplan, et al . 
1991; Klein, et al . 1991; Argetsinger, et al . 1993; 
Obermeier, et al . 1993; Weiss 1993]. In some cases, the 
lower affinity receptor when dimerized can be activated. 
[Ullrich and Schlessinger 1990; Stahl and Yancopoulos 
1993; Claesson-Welsh 1995]. 

Among many hormones and hormone receptors, it is now 
apparent that an unexpected and unanticipated degree of 
structural homology exists with subgroups of these signals 
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and receptors forming homologous families which sometimes 
follow along different genetic evolutionary lines. Other 
functional similarities may be brought about as a result 
of convergent evolution. In either case, the active 3D 
conformations of ligands and receptors appear to follow 
some general principals. However, for drug discovery, the 
principals gleaned from these studies have not yet been 
detailed enough to bypass crystallography of particular 
hormone/ receptor complexes in order to gain sufficient 
specific information as to deduce the molecular shape of 
active small organic molecules. 

Deciphering the elements necessary in a signal to 
activate a hormone /growth factor receptor has included (1) 
crystal formation and analysis at <3A of receptor and 
endogenous ligand complexes; (2) the influence on function 
15 (i.e, ligand binding and receptor activation) caused by 

molecular biological mutagenesis of single amino acids or 
short peptide deletion/replacement, or chimera formation 
of both the hormone and receptor units. In addition, 
monoclonal antibody binding to surf ace domains available 
when ligand and receptor are either uncomplexed or in the 
R:L coiqplex, along with the ability of Fab2 versus Fabl 
to activate or block receptor activation in vjtyg, in situ 
or is vivo has been studied. 

The above studies when taken together, provide 
25 information concerning (1) the contact points between 

hormone and receptor; (2) the amount of energy of binding 
involved in these contact points; (3) amino acids outside 
of the receptor: ligand contact points essential for global 
receptor/ligand stability or dimer stability, or receptor 
30 signalling activity (i.e. tyrosine kinase, binding of 

other intracellular regulatory factors, internalization, 
uncoupling for effector system) . 

Critical for identifying small organic molecules 
which are active at hormone receptors are the data from 
35 the above indicating (1) number of units/active complex; 
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(2) amino acids of the target specifically involved in the 
binding domain with the endogenous ligand; and 3) amino 
acids of the ligand specifically involved in binding 
and/or activating the target. Of all of the above 
information, clearly the rate limiting event today is 
obtaining sufficiently resolved crystallography data of 
hormone/receptor complexes. However, complexes of 
receptor and ligand are often difficult to identify and 
crystalize thus preventing one from obtaining the 
structural information. It is also recognized that the 
various molecular, biological, immunological studies, 
biochemical and pharmacological studies noted above, also 
take considerable time and effort. Accordingly, prior art 
approaches to identifying active small organic molecules 
are long and arduous with unpredictable results. 

In the approach outlined above, it is important that 
both structural and biological data be obtained as each 
has its own limitations and artifacts. Also, contact 
points could reflect specific aspects of crystal formation 
which do not reflect the structure <at the protein in situ , 
or the crystal may contain an inappropriate number of 
subunits. On the other hand, the biological data 
generates both false positives and negatives. 
Furthermore, if antibodies are used to probe the binding 
site of the target, not all receptor or ligand surfaces 
25 may be immunogenetic accessible to. Fab2 or Fabl antibody. 
Another problem is the difficulty studying allosteric 
sites which do not interact directly with the signal 
ligand. 

Despite considerable effort, a major problem in drug 
discovery has been the identification of small organic 
molecules capable of activating peptide hormone/growth 
factor receptors. This is likely the result of the 
multivalent nature of endogenous ligands for these 
receptors and the requirement to dimerize or 
35 simultaneously activate multiple attachment sites on a 



20 



30 




• 



WO 96AM557 



PCT/US95/10182 



- 11 - 



o 



single receptor (receptor subunits) for receptor 
activation. Even for receptors which are homodimers, such 
as GH receptor (GHR) , a single small organic molecular 
monovalent attachment to the GHR site I is not sufficient 
to cause activation, nor displacement of growth hormone 
^ from its active divalent dimer receptor complex. 

Failure to find single small organic molecules in 
conventional binding assays steins from the fact that the 
labelled hormone is bivalent, and its displacement from 
two receptor units by a single monovalent small organic 

10 molecule (i.e. compounds which attach to only one receptor 
target at a time) is thermodynamically unfavorable in the 
present day binding assay. Furthermore, in the large 
majority of cases the receptor for a given hormone is a 
heterodimer. Thus, for a given hormone /growth factor - 

^ receptor binding pair, there may exist at least two 

different binding sites on the target which may be due to 
the multimeric nature of the target or a target consisting 
of allosteric sites on a monomeric unit. In all of these 
cases, the endogenous ligand mast therefore comprise at 

20 least a sufficient number of binding sites which are 
properly spaced to bind to the multiple sites on the 
target necessary for activation. Obviously, one would 
then require a multimeric or a multivalent small organic 
molecule for displacement of these hormones from their 

25 targets. 

Given the complexity required of each small organic 
molecule to bind the receptor at the multiple sites 
necessary for activity, or to displace the endogenous 
ligand, one could expect that the occurrence of a single 

30 small organic molecule with two unrelated yet active 

binding domains would be equal to the chance of finding 
one multiplied by the chance of finding the other 
independently. As active small organic molecules are 
found by random robotic assays at a frequency of between 

35 1/1000 to 1/10,000 on most screens for ligands requiring 
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only one binding site on the ligand, and which have 
correspondingly a single binding site on the receptor, one 
would expect to screen an organic chemical libraries 
containing from 10 6 to 10 1 compounds in order to identify 
an active molecule. Such libraries exceed those which 
could be screened in some reasonable assay format and 
actually exceed most made by even the largest 
pharmaceutical companies . 

Therefore, a different approach to screening for 
small organic molecules which can activate hormone 
receptors is needed. 

A number of libraries now exist for screening such 
large numbers. Two have been noted already, the 
oligonucleotide and peptide library. Another such file 
contains natural products. 

Classical chemical libraries consisting of 
synthetically derived small organic molecules are 
routinely available from commercial sources (e.g. 
Alldrich, and Kodak) and consist of upwards of a 1-200,000 
entities. Recently a survey of the chemical entities 
within such libraries uncovered 100,000 or so chemical 
structures as being the cores upon which most of the 
individual entities were crafted. The average molecular 
weight of the entity within such files ranges between 
200-400 Dal tons which would account for no more than one 
such contact site per target . 

Screening of small chemical compound libraries is 
limited only by their availability, which most often is 
<100,000. 

With the advent of molecular biology and gene cloning 
and sequencing, it has been discovered that most 
pharmaceutical targets are not unique entities unto 
themselves, but in fact belong to families of sometimes 
rather large size and close relatedness . Recognition of 
this fact has mandated a much more serious look at all of 
the members of the family to which the target under 
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investigation belongs so as to identify lead compounds 
which can distinguish among its family members. If one 
used only binding assays as a primary screen for potency, 
activity, selectivity and specificity, one would require 
affinity labelled standards for each of the family 
5 members. Although this is potentially possible when the 
endogenous ligand signal are proteins due to their native 
affinity and ease of labelling, it is not presently 
feasible where small organics are the only known signals . 
This approach is also unsuitable for targets with 
10 unidentified signal ligands. Any discovery of how to 

include such widespread specificity testing into primary 
binding screen assays would greatly increase the 
probability of drug discovery success. 

15 SUMMARY OF THE INVENTION 

This invention provides compositions and methods for 
identifying active surfaces of biologically active sites 
of pharmaceutical targets. Identification of these sites 

2® is useful for preparing reagents suitable for use in 

screening assays of small organic molecules to identify 
those as. candidate lead compounds possessing desired 
attributes of biological activity, specificity, 
selectivity and affinity. 

25 Reagents are provided by this invention which are 

suitable for identifying active sites on pharmaceutical 
targets. The reagents comprise libraries of variable 
regions of antibodies obtained and modified by molecular 
biology techniques which are used" to prepare recombinant 

30 pab fragments (rVab) useful for scanning the surface of a 
target in a manner so as to identify those rVab's having 
desired potency, activity, specificity and selectivity. 
The attributes of potency, activity, specificity and 
selectivity are collectively referred to as a "composite 

35 activity profile" (CAP). The rVab's which are made and 
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identified by this invention as possessing the desired CAP 
attributes specifically bind the target (i.e. are T+) , are 
selective for the target (S + ) and activate the target or 
are capable of activating the target when combined with 
another ligand (A + ) . 

By combining structural features of various members 
of the recombinant antibody library which possess activity 
at a defined pharmaceutical target, this invention 
provides a method of determining a composite structure 
possessing the desired composite activity profile. This 
composite structure may then be used to identify small 
organic molecules capable of acting at the target surface 
with either agonist or antagonist activity with the 
sufficient specificity and electivity. 

The method according to this invention of identifying 
ligands capable of binding to active sites and possessing 
a composite activity profile for a given pharmaceutical 
target comprises combining members of a recombinant 
antibody library with a pharmaceutical target coupled to a 
reporter which reporter is capable of signaling activation 
or inhibition of the pharmaceutical target. Reporters of 
pharmaceutical activity may include but are not limited 
to, for example, receptor coupling to modulators such as 
the G protein; oligomerization of receptor subunits; 
changes in enzymatic activity such as kinase activity; or 
changes in ion flux. According to. this method, individual 
members of the library possessing -desired activity as 
demonstrated by the reporter, are useful individually or 
collectively in subsequent assays to identify small 
organic molecules capable of possessing the desired 
activity at the pharmaceutical target. By combining 
structural features in common between multiple members of 
the library possessing the desired activity, a composite 
structure for activity may be derived which may then be 
used to create a model for a compound possessing the 
desired activity attributes. 
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This invention also provides a method of identifying 
small organic molecules which are active at the target 
sites comprising screening potential drug candidates in a 
binding assay for their ability to displace labelled, rVab 
members possessing a desired composite activity profile 
consisting of potency activity, selectivity and 
specificity for the pharmaceutical target. 

Small organic molecules as candidates for drugs may 
also be identified by analyzing the structure of the model 
derived from the structure of at least two active members 
of the rVab library and determining common characteristics 
including, but not limited to charge and spacial 
orientations which participate in binding to the active 
sites of the pharmaceutical target. Using the model, 
small organic molecules may be obtained by synthesizing 
compounds possessing the common structural features 
identified in the model, or screening a chemical file data 
base for members possessing features in common with the 
model . 

This invention also provides means of identifying 
20 structural requirements of ligands capable of binding to 
pharmacological targets comprising multiple binding sites 
existing on one or more molecular entities which when 
bound by a single ligand are capable of activating the 
pharmacological target. Similarly, this invention 
25 provides a means of identifying structural requirements of 
multivalent ligands capable of activating pharmacological 
targets comprising binding sites too large to be occupied 
by a monovalent small organic molecule or requiring 
concurrent binding of a multivalent ligand to effect 
30 oligomerization of separate molecular entities to form an 
active pharmacological target. 

This invention also provides reagents comprising 
recombinant antibody libraries (rVab's) which have been 
constructed to encode CSR and CDR regions with specific 

35 
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variations and in which the CDR and CSR regions are 
expressed on a specific identifiable framework structures. 

The recombinant libraries of the invention may be 
packaged in various forms including bacterial phage which 
express the recombinant antibodies on their surface. 

It is therefore an object of the present invention to 
provide a process for the identification of small organic 
molecular replacements capable of modifying a 
pharmaceutical target with a desired composite activity 
profile comprising sufficient potency, activity, 
specificity and selectivity to be considered as an initial 
drug discovery lead. 

It is a particular object of this process to identify 
surfaces of a pharmaceutical targets capable of 
discriminating among members of a family of related 
targets which are activated by the same or similar 
endogenous ligand or utilize similar signal transduction 
mechanisms . 

It is a particular object of this process to identify 
active or regulatory surfaces of a pharmaceutical target 
which may or may not be used by an endogenous ligand for 
the target of interest, and which is nevertheless capable 
of modifying the pharmaceutical target in some 
pharmaceutically useful manner. 

It is a particular object of this process to identify 
allosteric sites on the pharmaceutical target which are 
not used by endogenous signals nor have activity on their 
own, as well as active allosteric sites which are used by 
endogenous signals other than the pharmaceutical target 
activating signal and which have some type of activity on 
their own. 

It is a particular object of this process to provide 
a repertoire of surface recognition libraries which 
together recognize diverse pharmaceutical target surfaces 
by constructing a small number of combinatorial antibody 
35 libraries. 
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It is a particular object of this process to convert 
by a single simple and rapid process any unlabelled 
recombinant variable antibody fragment (rVab) isolated 
from a library to a labelled one to act as a reagent 
capable of identifying small organic molecules which 
possess any one, or combination thereof, of the attributes 
of potency, activity, specificity or selectivity 
simultaneously when screening random chemical libraries. 

It is an object of this process to identify the 
specific binding regions of pharmaceutical targets 
requiring binding to sites in at least two different 
regions to cause a response of the target. Such regions 
may be present on monomeric or oligomeric pharmaceutical 
targets. The endogenous ligands for such sites generally 
are multivalent monomeric or oligomeric proteins which 
bind to the multiple regions which define the active 
surface of the pharmaceutical target. 

This invention provides a method for identifying the 
structural requirements for ligands to bind at the 
separate regions and identifying such ligands. By 
combining the ligands capable of individually binding to 
the separate regions into a single molecule, fully active 
ligands are provided. 

It is another object of this invention to identify 
the monovalent determinants making up the active surfaces 
^ on the targets for large protein signals such as hormones 
and growth and differentiation factors consisting of 
oligomeric receptors. Such receptors may contain 
homologous or heterologous components with one or more of 
these units containing a part of the signal recognition 
determinant . 

It is a particular object of this process to use 
chemical oligomerization of small organic molecules for 
each of multiple binding sites to derive an active 
oligomer for large proteins such as growth factors and 
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hormones which contain multiple binding sites within their 
active binding domains. 

Accordingly, another object of this invention is to 
identify small organic molecule replacements for large 
protein signals such as growth factors and protein 
hormones be they allosteric or competitive modifiers and 
whether they be monovalent or multivalent. 

It is a particular object of this invention to 
identify small organic molecule replacements for 
pharmaceutical targets which have no bioorganic endogenous 
ligand signals, such as certain ion channels, pumps, and 
exchangers . 

It is a particular object of this invention to 
provide high volume binding assays which discriminate 
agonist from antagonist small organic molecule 
replacements . 

It is a particular object of this invention to be 
able to identify from large antibody variable region 
libraries, individual variable regions which distinguish 
from one another binding sites which confer selectivity of 
pharmaceutical targets for specific members of a gene 
family. 

It is a particular object of this process to provide 
labelled antibody variable regions which interact with and 
modify the activity of targets which have no identified 
endogenous ligand, nor exogenous natural signals, and 
which labelled ligands have sufficient affinity for the 
pharmaceutical target to be used in competing binding 
assays in which small organic molecules may compete for 
binding with the labelled ligands. 

It is another object of this invention to provide a 
plurality of different recombinant antibody variable 
regions which recognize at least one common binding site 
of- a pharmaceutical target and which collectively provide 
structural information useful for designing active small 
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organic molecules which are active at the pharmaceutical 
target . 

It is another object of this process to provide a 
general method to rapidly obtain peptide structures which 
are useful as 3D models comprising the minimum 
5 characteristics of small organic molecule replacements 

which have sufficient potency, activity, selectivity and 
specificity to classify as viable discovery leads. 

It is a particular object of this process to provide 
molecular models for active ligands wherein the 
10 pharmaceutical target necessary to be occupied by active 
ligand comprises one or more binding sites on one or more 
molecular entities. 

It is a particular object of this process to be able 
to solve the canonical structures of the CDR VH3 of 
15 recombinant antibodies which have been identified as 

possessing the desired properties of potency, activity, 
selectivity and specificity. 

It is a particular object of this process to be able 
to use composite structural characteristics to direct a 
20 synthetic effort capable of directly synthesizing active 
small organic molecules. 

RPTEF DESr PTPTION OF THE FIGURES. 

25 FIG.l. Stages of the Topographic System Assay <TSA) . 

Fig. 1 shows the activities and products of the three main 
stages of the TSA. When combined together. Stage I and II, 
or Stage I and III, allow the identification of small 
organic molecules (SOMERS) which are active at 

3° pharmacological targets (T) . A MULTIMER is at least two 
SOMERs covalently linked together to produce an active 
molecule. A BEEP is a biologically enhanced ensembled 
pharmacophore, and Tn* is subunit n of pharmacological 
target . 
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FIG. 2. Related Antibody Structures and Variable 
Region Domains. A. Shows various forms of antibody 
structures including the variable (V) and constant (C) 
regions of immunoglobulin (Ig) heavy (H) and light (L) 
chains. Antibodies constructed in this invention by 
molecular biology technology have a r prefix. B. Shows 
details of the antigen recognition Variable region (V) 
domains of the VL and VH. FW is the 'constant' framework 
regions; "CDR" refers to the complementary determining 
regions as defined by Kabat (Rabat 1991) ; CSR refers to 
canonical structures found in CDRs as originally defined 
by Chothia (Chothia and Lesk, 19 87) ; V (with leader 
sequence), D (diversity) and J(V/C junction) are the genes 
which are combined to create the mature VH and VL genes. V 
Regions are attached via genetic recombination for VL to 
either a kappa or lambda Constant region. VH are 
recombined with three Constant regions in sequence with 
CHI being attached to VH. The V regions of the invention 
can used either without C regions, or with kappa or lambda 
for CL, and up to three C regions for CH. 

FIG. 3. Potential Planar, Cavity and Grove Antibodies 
of Known Crystalline Structure for rVab Library 
Cons truct ion. Fig. 3 lists a number of antibodies for which 
there data is known concerning their crystalline structure 
and which are potential parental antibody structures for 
construction of the rVab library as described in this 
invention. The antibodies are grouped according to their 
type of antibody combining site : i.e., planar, grove or 
ca vi t y - type structure. 

FIG. 4. Comparison of Natural Fab and rVab Library 
Diversification. A Nature's Immune Repertoire: V, D and J 
are the genes recombined to make the mature V gene; rf* 
are the reading frames of the D gene which can be used to 
make sense protein sequences upon recombination with V and 
J. CDR* are there are no CSRs for the VH3 region. The 
number of known CSR for each CDR is given in parentheses. 



r 

WO96/04557 PCT/US95/101K 



- 21 - 

B. The rVab Repertoire: Diversification arises by using 
all permutations of the known CSRs, 3 different length 
CDRH3 and randomization of amino acids at two positions 
within each CSR (or GDRH3) within a single VH and VL 
parental framework structure. Primary randomizations are 
5 made during construction of the rVH and rVL (see Figs. 

7,8) and allows all 20 essential amino acids to appear at 
given positions within V regions among members of the rVH 
and rVL libraries. CDRH3 are three known CRDH3s of 
different sequence and three different lengths covering VH 

10 amino acid positions 95-102 (see text for details). rVab 
is encoded by one rVHCHl and one rVLCL gene on the same 
piece of DNA. Totals of CSR include CSR and CDRH3 
combinations . 

FIG. 5. Type and Diversification of Amino Acids at 

15 various positions within V region. Numbering of the amino 
acid (AA) positions as per Rabat (Rabat . 1991) . Library 
Diversification identifies the high priority candidate 
amino acid position for primary library diversification 
during construction of rVH.lib and rVL.lib as described in 

^ this invention. 

FIG. 6. CDR and Canonical Structures (CSR) of V 

Regions. 

Particular amino acid (single letter code) at V gene 
positions critical for particular CSRs are given as 

25 defined by Chothia (Chothia and Lesk, 1987) . * represents 
amino acids not within CSR or CDR which participate in 
defining the CSR. The diversity position is the amino acid 
position used for primary library randomization as 
described in this invention. 

30 FIG. 7. Construction of the rVLCL Lib. of Diversified 

Canonical CSRs: rVLCL.Lib. A-F are sequential steps of the 
process constructing rVL.lib. G is the final step of 
recombination of rVL.Lib with a rCL to form rVLCL.Lib. 
Amino acid positions occur in brackets; nucleotide 

35 positions are given left to right as 5' -3' in 
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parenthesis; restrict ion sites (rs) also appear in 
brackets and have a n p" prefix to denote when they are 
located within the plasmid and not the V region. 
Restriction sites are denoted by combinations of letters 
and numbers. Primer direction is denoted by arrows (left 
is forward (FWD) ) , and right is a reverse (BCK) primer) . 
* denotes more than one amino acid at a CSR position which 
is critical for a particular CSR; A denotes that 
diversification by randomization of amino acids with CSR 
or CDR has occurred. Lib suffixes indicate a library of 
many individual members. Heavy line indicates that the 
product (single entity or library) has been cloned in to 
plasmid pCLONALL (pC) . 

FIG. 8. Construction of the CSR and CDRH3 
Diversified rVHCHl.Lib. Construction of CSRHl and H2 and 
three CDRH3 of different lengths (i.e., 5. 7. or 10 amino 
acid insertions) ; diversification by amino acid 
randomization and combination of CSR and CDRH3 in all 
possible permutations is as illustrated in a manner 
analogous to that described for rVLCL.Lib (see legend to 



FIG. 9. General Usage Plasmids. A. Illustrates the 
sequence of restriction sites (rs) which occurs in the 
cloning site of pCLONALL . Use of each in rVH.Lib and 

rVL.ljib construction is noted wherein " ■ denotes a 

restriction site used and defined by parental AB sequence; 
wherein X denotes a restriction site not used in that 
particular rV.lib construction. General positions of 
restriction sites within the rV and rC regions under 
construction are shown. JCH and J*TCL are the natural J/C 
gene recombination region with included amino acid 
positions given in brackets. JCLINK is the position of the 
J/C recombination restriction site, also referred to as 
rs3 . B. Events used in constructing the plasmids carrying 
rC regions and in the final step of rVab.Lib construction 
wherein rV regions are appended to rC regions. The two 
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plasmids needed for this are listed as pVxACCEPTORs . C. 
Plasmids used in creating expression vectors for the 
rVHCHl and rVLCL chains of the rVab when not attached to 
phage coat protein gpIH- EK is the enterokinase cleavage 
site. ISOTAG is the additional amino acid sequence useful 
in isolation and labelling rVab as rVab - REPORTER 
constructs . 

FIG. 10. General Primer Table. Primers are written as 
5' -3'. Numbers and single letters designate individual 
amino acid positions which in the primers would be 
corresponding triplet codon sequences for the amino acid 
at these positions. The letter N within parenthesis 
denotes the random appearance of the nucleotides A,T,U,C 
used to randomize the amino acid at this position. 
Letters, without parenthesis, are used for sequences 
necessary for a desired CSR or CDRH3 structure; numbers 
are used for sequences which are not critical to CSR or 
CDRH3 structure. rs is a restriction site sequence. 
Sequences for all FWD primers are complementary to the 
sense sequence. Approximate primer sizes in nucleotides 
are listed as #mer. The right hand column signifies 
general use of primer with amino acid randomization; and 
SEQ. is sequencing. 

FIG. 11. Constructs for CRE-LOX re combinatorial 
formation of rVab.lib: Parti. Expression of rVab with or 
without one attached random octamer peptide (Pep 8) 
library. Figure 11 illustrates the steps generating the 
necessary phagmid and plasmid constructs to allow in vivo 
recombination of individual rVHCHl.lib and rVLCL.lib 
members, by the-Cre recombinase, and the construction of a 
single phagmid containing an rVHCHl and rVLCL member on 
one piece of DNA (i.e., an rVab> . This procedure is used 
for rVab.Lib construction where there is no need in the 
TSA discovery process for subsequent addition to rVab of 
more than one random octamer peptide (Pep8.Lib). Wild type 
(wt) and mutant (511) loxP sites are as defined in legend 
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to Fig . 12. LpelB and LgpIII are leader sequences for pelB 
and gpIII. 

FIG. 12. CRE-LOX Plasmid and Phagmid Sequences used 
for rVab.lib Construction. For use in rVab.lib 
construction by in vivo Cre- recombinase directed 
recombination of rVHCHl and rVLCL onto single phagmids 
where there is a subsequent need in the TSA process for 
attachment to rVab of no more than one random peptide 
library. 

FIG. 13. Constructs for CRE-LOX Re combinatorial 
Formation of rVab.Lib: Part II. Expression of rVab with 
or without one or two attached random octamer (Pep 8) 
peptide libraries. Steps involved in adding PepS.Lib; i) 
illustrates expressing one peptide (PEP ,M ) at the amino 
terminus of VH (Pap8 ,M ) ; ii) illustrates expressing one 
peptide at the carboxyterminus of CL (Pep 1 *) ; and iii) 
illustrates expressing one peptide at the aminoterminus of 
VH <Pep8 ,M and one peptide at the carboxyterminus of VL 
(PepS 2 **) . Step E illustrates use of two primers required 
to append Pep8.Lib to either VH or CL. 

FIG. 14. In vivo. Generation and Expression of 
rVab.Lib members. The generation of rVL and rVH gene 
pairs (rVab) as one DNA molecule, as well as the 
expression and phage display of rVab attached to coat 
proteins of fd is illustrated. Synthesis of rVHCHl- and 
rVLCL proteins and their complexation to form gpIII 
attached rVab for phage display is illustrated showing 
cells, such as bacteria, infection of bacteria with phage 
carrying rVLiCL and transformation with DNA plasmids 
carrying the rVHCHl- construct; and in vivo recombination 
of rVHCHl and rVLCL onto a single fd via the LOX sequences 
and the PI provided CRE- recombinase. Following 
recombination and replication, a combined single 
expressible pair of rVab genes is packaged per phage. 
When induced, rVLCL is made and introduced via its leader 
into the periplasmic space were its complexes under 
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reduced conditions with synthesized rVHCHl-gpIII coat 
protein to create the desired rVab complex attached to the 
gpIII phage coat protein (see text for details) . 

FIG. 15 Flow Diagram of Diversification and 
Simplification Paths of the TSA. Steps are outlined for 
optimizing TSA+ attributes of rVabs for a given 
pharmacological target. The library attributes are potency 
of binding to Target (T) , specificity and selectivity for 
Target (S) and regulation of target Activity (A) . 
denotes that the attribute is present in the rVab member. 

FIG. 16. Isolation of Target (T+) Specific/ 
Selective <S+) rVab. A. Isolation by panning for Target 
recognition (binding) (T+) . B. Isolation by panning for 
Target Specificity and/or Selectivity (S + ) . Isolation of 
T+ and S+ rVab can be done in any order, and when used 
15 together isolate rVabTS + members. T denotes the 

pharmacological target; * phage displayed rVab; com-T-pep 
represents the entity, holotarget, subunit or peptide 
fragment, which is to be distinguished from the Target. 
Binding to the com-T-pep prevents rVabS' binding to matrix 
attached T. 

FIG. 17. Selection of rVab Scanners for Active Target 
Surfaces Used by Signals with Single Attachment Sites. Fig 
17 presents a flow diagram of the TSA process isolating 
rVabTSA+ members from a rVab library previously identified 
as T+S+. T, S and A are defined in legend to Fig. 15. 
Native signal is the endogenous or previously identified 
agonist entity (e.g., protein, peptide, neurotransmitter) 
which activates Target by interaction at a single 
attachment site (see Text for details) . Allosteric 
Effector is an endogenous or previously identified entity 
which binds to a single attachment site on Target which 
modifies agonist activity but has no activity on its own. 
rVabA+ are isolated by competition by native signal or 
allosteric change in T by allosteric effector which 
prevents normal rVaBTS+ binding to T. The binding of 
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rVabTS+A- members is unaffected by the presence of the 
allosteric effector or native signal and therefore is not 
isolated free in the supernatant during this process. 

FIG. 18. Discovery of SOMERs for a Target with a 
Single Univalent Active Site. Fig. 18 illustrates the steps 
of the TSA process of rVab- Scanner to Reporter conversion 
and Reporter use in competitive binding assays to identify 
active SOMERs for the pharmacological Target. Both 
competitive and allosteric active SOMERs are identified in 
this process. 

FIG. 19. Identification and Isolation of Active 
rVabTSA* for the Muscarinic Acetylcholine Receptor subtype 
ml (AChR fnl ) . (s) -denotes matrix attached Wheat Germ 
Agglutinin; T,S and A are as defined in the legend to 
Fig. 15; R denotes receptor target; G denotes guanine 
nucleotide binding protein; RG denotes RG noncovalent 
complex; 0 denotes phage displaying rVab. The TSA process 
isolating rVab based on specificity/selectivity (see Fig. 
16) is illustrated for the isolation of AChR^ rVabS mI + 
using Agonist -Like rVabT+A+ ( type l rVabTA+) . The same 
process is used for isolating Partial -Agonist -Like, 
Allosteric-Agonist-Like and Competitive Antagonist -Like S+ 
rVabTA+ (i.e., respectively type 2,3 and 4 rVabTA+) 

FIG. 20. Isolation of Active rVabTSA* for Complex 
Active Sites on Dimeric Receptor Targets ?T,. 2 ) . Fig. 20 
illustrates the TSA process by which the rVab pair for 
each part of the active site on each of two receptor 
target subunits (Tl or T2) is isolated. The process is 
shown in full for one member of the pair; that for the 
active site region on Tl, and is duplicated for the active 
site region on T2. m-T denotes matrix attached Target; 
comp-T- receptor denotes comp-T-pep as described in Fig. 16. 
0 denotes phage displayed. PepS.Lib is the random 
octapeptide library displayed as a fusion protein with 
phage coat protein gpIII. Pep8T 2 + is the library of 
peptides which bind to T2. rVabTl - Pep8T2 . Lib is the 
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rVabTlS+.Lib to which the Pep8T2+ Lib has been appended 
(see Fig. 12 and 13 for details of rVab-pep.Lib 
construction). Preselection of the T2 + Pep8.1ib is not 
required and a random Pep8 Lib can be used in this 
process. Testing for rVabS+ is optional and can be done at 
any step along the process. The related 

rVabT2m+S+A+-Pep8Tl+ member of the active domain pair is 
obtained in parallel analogous manner. 

FIG. 21. Using Active Bivalent rVabTl-Pep8T2 to Screen 
for Disomer Replacements of a Multivalent Signal. Fig. 21 
presents a flow diagram of the steps of the TSA in which 
each rVab member of the active pair of rVab-Pep8 for both 
domains of the active site, which occur on separate T 
subunits are used to find a DISOMER replacement for the 
native signal and which regulates Target activity. [A+] 
denotes that the rVab-Pep entity is active in regulating 
the Tl-2 dimeric Target. A* denotes that the rVabTS+ 
member is derived from a rVab-Pep entity which is [A+] . 
DISOMERmn denotes covalent linked SOMERs for the pair of 
active site domains identified by the paired rVabTSA* 
members . 

FIG. 22 Summary of the Discovery of DISOMERs for a 
Bivalent Hormone . 

FIG. 23. Flow Chart of TSA Steps Creating and Using a 
Biologically Enhanced Ensembled Pharmacophore (BEEP) . 
^ FIG. 24. The TSA Process of Finding and Relating Sets 

of Surface Attributes of rVabTSA* to Create a BEEP. 

FIG. 25. The TSA Process of Finding the Surface Common 
to All Active rVabTSA* Scanners for an Active Site of a 
Target . 
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This invention provides methods and compositions for 
identifying ligands capable of identifying active sites on 
pharmacological targets. This invention utilizes 
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recombinant antibodies which possess the combined 
attributes of potency (affinity) , selectivity, 
specificity, and activity as reagents useful for modelling 
active ligands and identifying small organic molecules 
which also possess these attributes and therefore utility 
as drug leads or therapeutic compounds. 

I. Pharmacological Targets Identified By This Invention, 

Pharmacological targets may be receptors for 
endogenous or other ligands which evoke a physiological 
response by the cells on which the receptors are present. 
Besides receptors, the pharmacological target may be ion 
channels, transport proteins, adhesion proteins such as N- 
CAM, or any other physiological regulatory surface which 
is excessible to being identified by the recombinant 
antibodies and which is activated by a specific ligand. A 
non- limiting list of exemplary physiological ligands for 
which active surfaces may be identified by using the 
methods and compositions of this invention are listed in 
Example 4 . 

Receptors may include those for neurotransmitters, 
hormones, growth or trophic factors, modulatory peptides, 
ions or other moieties which act as signal ligands for the 
pharmacological target. Preferred nonlimiting examples of 
neurotransmitter and peptide receptors for which active 
surfaces may be identified include those for 
acetylcholine, i.e., nicotinic, and the various forms of 
the muscarinic ml -5 receptor subtypes; adrenergic 
receptors including at,, a 7 , 0,, f} 7 , ; dopaminergic receptors 
including D |f D^, D a , D 3 and D 4 , and D 5 ; serotonin receptors 
including 5-HT,, 5-HT IA . D , 5-HT 2 , 5-HT 3 , and 5-HT 4 ; 
benzodiazepine receptors; opiod receptors including 6, *, 
and fi; and others. Also preferred are receptors for 
hormones and growth factors which may, for example, 
include those for insulin; growth hormone; erythropoetin; 
neurotrophic factors, including but not limited to nerve 
growth factor, ciliary neurotrophic factor, brain derived 
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neurotrophic factor, NT- 3 and NT-4. Receptors for 
cytokines such as interferons, and the interleukins are 
also preferred as are receptors for nonpeptide hormones 
such as thyroid hormone, and glucocorticoids • The methods 
and compositions of this invention described herein may be 
adapted by methods known in the art and applied generally 
to identifying the specific binding surfaces of other 
pharmacological targets as well. 

Other target surfaces for which active ligands may be 
identified include extracellular, intracellular, nuclear 
or mitochondrial located soluble or membrane associated 
proteins, carbohydrates, lipids nucleic acids or complexes 
thereof which play a role in a physiological or 
pathophysiological process involving a predictable 
indication for which one would like to have a drug based 
therapy. 

The pharmacological targets according to this 
invention, are physiological molecules, or combinations of 
molecules associated through covalent or non-covalent 
forces, which alone or in combination with other 
molecules, evoke a physiological or therapeutic response 
when activated by a ligand which binds the "active 
surface" of the pharmacological target. By "active 
surface" is meant the region of the pharmacological target 
which can bind a ligand, whether or not there are native 
^ endogenous ligands for these sites, and translate that 
binding into a physiological meaningful response 
characteristic of the target. Where the response requires 
oligomerization of at least two separate molecular 
entities by a ligand, binding to the active surface on 
30 only one of the molecular entities is insufficient to 
evoke the physiological response. 

The active surface is comprised of specific atoms or 
other chemical moieties which participate in the binding 
of the ligand to the pharmacological target, for example 
35 by contributing to changes in enthalpy or entropy. The 
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active surface of the pharmacological target may be small, 
capable of being bound by a single monovalent ligand 
having a molecular weight of less than about 1000 daltons; 
or large, requiring a multivalent ligand for binding to a 
plurality of binding sites which contribute to the active 
surface. Multiple binding sites may be present in a 
larger binding domain in a single region of the 
pharmacological target. Alternatively, multiple binding 
sites may be present as separate non- contiguous regions 
which may be bound by a ligand capable of spanning the 
pharmacological target to simultaneously bind the 
different binding sites of the target. In addition, 
binding sites may be present on two or more molecular 
entities, which may be the same or different, and which 
require oligomerization by binding to a multivalent 
ligand. 

Growth Factors (GF) , including NGF, BGF, FGF, 
interleukin (e.g. IL2, 4, 6) interferons, insulins and 
many other extracellular biosignals along with their 
respective receptor targets apparently contain multiple 
target binding sites. Such protein signals are in the 
order of 20-1000 K Daltons and exist as monomers or 
homo- or heterodimers or more complex mul timers, which 
encompass surface areas of tens of thousands of A 2 . 
Estimates of the surface area of such endogenous ligands 
and receptors which are occluded by their association 
ranges from 500-1600A 2 By the above definition, each 
ligand has >2 binding sites and each receptor has >2 
corresponding binding site which are discontinuous and 
non -overlapping with each other. 

II. Use of Recombinant Antibodies rVab's As Scanners 
To Identify Active Surfar^s __ 

This invention identifies and characterizes active 

surfaces by constructing and using a sufficiently large 

repertoire of diverse ligands capable of "scanning" the 

surface of pharmacological targets and binding to their 
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active surfaces. Confirmation of binding to active 
surfaces is accomplished according to this invention by 
monitoring a change in function of the pharmacological 
target or by monitoring a biochemical or biophysical 
change which reports binding and/or activation of the 
pharmacological target or receptor on the target. 

Antibodies have most of the above required attributes 
and can be recombinant ly engineered so as to acquire 
unique attributes required for use in this invention. It 
is well known that antibodies occur which are neutralizing 
and therefore by definition antagonistic in that they 
prevent, competitively or alios terically, the binding of 
signal to receptor, or receptor activity. 

Antibody epitopes in protein targets range from a few 
amino acids to about 20 amino acids and cover from 
hundreds to thousands A? of target surface. In addition, 
epitopes can comprise sequential or noncontiguous groups 
of amino acids. However, it is equally clear that 
antibodies can recognize organic epitopes which are 
relegated to much smaller volumes, (i.e., <50-200A 2 ) as 
are those associated most frequently with small organic 
haptins (i.e., dinitrophenol or morphine). As antibody 
affinity and selectivity can be equal, with" both large and 
small epitopes, it is assumed that ant i- target rVab 
antibodies will have landscape recognition surfaces which 
range over all of these dimensions. 
A. use of rVab Libraries 

The repertoire of different ligands for scanning the 
pharmacological target according to this invention is 
provided by an antibody library comprising recombinant Fab 
fragments, or portions thereof, constructed to present a 
sufficiently large repertoire of different identifiable 
structures, some of which will be expected to bind and, 
depending on whether concurrent binding to multiple sites 
is required, activate the pharmacological target. These 
^ active antibodies are identified as specific members of a 
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library which may be considered to scan the entire surface 
of the pharmacological target and possess the desired 
composite activity profile for the binding site. 
According to this invention, the recombinant antibodies 
used with this invention are referred to as "rVab" to 
indicate that they are constructed using recombinant 
techniques and are made as libraries which incorporate 
diversified amino acid sequences in one or more regions of 
the antibody associated with target recognition or 
binding . 

Where the pharmacological target comprises multiple 
binding sites on one molecular entity, or requires 
oligomerization of at least two molecules to form a single 
binding site with contributions from the individual 
subunits, or requires oligomerization of two or more 
molecular entities which each bind to the ligand at a 
different site, activity will only be observed using 
antibodies modified according to this invention to contain 
at least one additional separate binding entity. In the 
preferred embodiment of this invention, the separate 
binding entity comprises at least one random sequence of 
amino acids having a structure appropriate to bind a 
binding site not bound by the antibody's variable region. 
In some cases two such random sequences of amino acids 
would be required although it is contemplated that 
additional sequences may also be required. Additional 
binding sites on rVab can also be provided by more or less 
complicated protein based structures including smaller 
peptides, larger proteins including intact enzymes or even 
another antibody, in structures described in the 
literature such as diabodies (Winter et al . 1994). 
Additional peptide sequences which may be used to add 
additional binding sites preferably are between about 5 
and 30 amino acids in length. More preferably such 
sequences are between about 6 and 12 amino acids in 
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length. Most preferably, such sequences contain 8 amino 
acids . 

An antibody identified as recognizing the binding 
site is simultaneously or sequentially further 
characterized by determining its selectively and activity 
5 for the pharmacological target. To streamline an rVab 
selection process for more than one target attribute, 
target specificity (T) and some of the activity (A) 
testing may be simultaneously characterized. 

The order of isolating Vab for A* and S + can be 

10 varied, most often depending upon which is the more 

difficult attribute to find among entities which modify 
the target of interest. For example, if selectivity among 
highly homogeneous target members of a single family is 
the critical missing attribute of existing agents, S+ 

15 could be determined first, or after isolation of the 
population which is A + . 

Although antibodies which recognize (i.e. bind) the 
target's landscape in such a way as to modify its function 
make up a small percentage of those capable of passively 

20 recognizing the target (i.e. not modifying its activity), 
their presence is likely because of the size and diversity 
of the rVab library of the invention. In addition, active 
antibodies would also be expected to be present which have 
the additional desired attribute of specificity for that 

25 target. Furthermore an embodiment of this invention 
includes that the biological suitcases (i.e, phage or 
bacteria) used to individually package each rVab library 
member allows their recoverability after a biological 
replicative cycle even if present in the original library 

30 in rare copies. 

This invention utilizes recent advances in molecular 
biology which allow the generation and manipulation of 
sufficiently large and diverse V (VH and/or VL) region 
libraries, along with both minimization and directed 

35 secondary diversification of their CDRs and CSRs to allow 
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selected rVabs, when labelled, to act as reporters and 
affinity selectors in assays which identify potential 
active ligands. Such active ligands are preferably small 
organic molecules which are useful as drug leads or as 
therapeutics themselves. 

B. Use Of rVAB Members To Identify Small Organic 
Molecule Replac ements (SOMERS) 

At least two methods are provided for identifying 
SOMERS based on the identification of the recombinant 
10 antibodies (rVab's) possessing the attributes of T 

(specificity/potency) S (selectively) and A (activity) . 
According to one method of the invention, these [rVab T+ 
S+ A+] scanners are converted to reagents for reporting 
the presence of other ligands capable of binding to the 
15 active site on the pharmacological target. Conversion to 
reporters is accomplished by labelling the active scanners 
with a detectable label. The reporter rVab fragments may 
then be used in classical competitive binding assays to 
identify SOMERS. For simple active surfaces, single 
SOMERS represent active small organic molecules, while for 
complex active surfaces containing more than one ligand 
binding site, corresponding numbers of SOMERS, found in 
the fashion disclosed by this invention, are covalently 
coupled together to represent the active small organic 
molecules . 

In another embodiment of this invention, SOMERS are 
identified based on the collective attributes of an 
ensemble of active rVab scanners which have been 
characterized as T+, S+ and A+. By providing a 
sufficiently large repertoire of antibodies, multiple 
antibodies possessing these desired binding attributes are 
expected to be identified. Common structural features of 
this ensemble of scanners possessing the desired CAP 
attributes are then used to construct a model ligand for 
binding to the active surface of pharmacological target. 
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By combining structural features of multiple antibodies 
identified as being active for a specific pharmacological 
target, biologically enhanced ensemble pharmacophores 
("BEEPS"), i.e., drug models, may be derived which may 
then be used to identify small organic molecules as drug 
leads or therapeutics. This molecular model, BEEP, then 
serves to provide a basis for screening chemical databases 
to identify SOMERS either by electronic screening of 
available chemical data bases, or as a basis for rational 
drug design to synthesize SOMERS expected to possess the 
combined attributes of specificity/potency, selectively 
and activity. This solves the prior art problem of access 
to all compounds within a chemical data base, decreases 
the time needed for screening and amount of manpower 
necessary, and could eliminate screening if used to direct 
a synthetic chemistry effort to create SOMERS. 

III. Recombinant Antibody Libraries Provide Sufficiently 
Large Repertoires Of Different Ligands To Identify 
Active S urfaces — 

A. Function Of Recombinan t: Library 
The objects of the invention are provided by a 
process which makes and then uses separately and/or in 
batch mode, combinatorial repertoire libraries of variable 
regions (VH and/or VL) of recombinant antibodies (rVab) to 
scan the surface of a pharmacological target so as to 
identify and select those which have a desired potency, 
specificity, selectivity and activity profile. These four 
attributes are collectively defined herein as the compound 
activity profile (CAP) . Members of the library possessing 
the desired attributes are then grouped according to the 
local surface domain recognized. By using a sufficiently 
large and diverse library as described herein it is 
expected that essentially most if not all relevant active 
surface of pharmacological targets should be identifiable 
using the method of this invention. In addition, because 
the library is recombinantly made in a random fashion and 




# 



WO 96/04557 



PCT/US95/10182 



- 36 - 



selected in vitro, recognition of sites which would not 
otherwise be detected as non-self , or antigenic, or 
immunogenic, should occur using the rVab library described 
in this invention. 

In addition, the objects of the invention relating to 
discovery of three dimensional shapes of surface areas are 
provided by use of the active rVab's of this invention as 
reporters of target structure. As described below, these 
rVab reporters are constructed using VH and VI* domains 
wherein the CDR regions which may be diversified are 
contained in a framework of an Ig (or Fab) having a three 
dimensional structure which has been determined by 
crystallography has CDRs which contain the known canonical 
structures (CSRs) . Such structural information about 
rVabs for a given delimited active target surface domain 
allows for the molecular resolution and deduction of the 
essential elements of the rigid organic structure of the 
constellation of critical amino acids constituting the 
active target surface recognition portions of the ensemble 
of active rVabs and thereby provide the essential elements 
of the rigid organic structure of active SOMERS which can 
bind with specificity to and modify that target. 

Construction of the BEEP requires PCR determination 
of the amino acid sequences of rVab CDR, CSR and some 
framework residues in these active surface scanners 
through a process which uses computers and genetic 
algorithms. It is also possible that with sufficiently 
large enough active rVabs, the information obtained in the 
above manner will enable resolution of the active surface 
of the targets. This process provides the objects of the 
invention related to electronic screening for SOMERS by 
combining common structural elements in computational 
packages called biologically enhanced ensemble 
pharmacophores (i.e., BEEPS ) . 

The recombinant antibodies used in accordance with 
this invention also provide an improvement over the prior 
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art of typical labelled target -reporter binding assay 
screens. One improvement comprises obtaining via 
recombinant molecular biology technology, antibody 
variable regions (V) in sufficient numbers, with 
sufficient affinity and desired activity so as to identify 
those members of the library which function as surface 
reporters capable of recognizing active target surfaces, 
modulating the target through these recognition sites and 
distinguishing its target from among closely related 
targets (selectivity) . 

B. Size Of Recombi nant Library 
In order to have a sufficient likelihood of 
identifying the active surface of a pharmacological 
target, the recombinant library preferably contains at 
least between about 10 9 and 10 u entities. Preferably the 
library contains between about 10 10 and 10 13 entities. Most 
preferably, the library contains about 10 12 entities. The 
specific size of the library required to provide a 
reasonable likelihood of identifying the active site will 
depend on the overall surface area of the target surface 
and the surface area of the binding domain to be 
identified. The surface of most targets is of the order 
of 50,000-100,000 A?, with each ligand binding domain 
encompassing from about 100-200 A 2 to about 1,000-2,000 A 2 . 
As each rVab covers only about 20-40 A 2 of surface area, 
one requires about 2,000 rVab's to cover the target 
landscape, and at least 10 times that (2xl0 4 ) allowing for 
overlapping recognition domains. Another increase of two 
orders of magnitude (2xi0 6 ) allows for appropriate surface 
interactions which produce specific agonist or antagonist 
action. Another 100 fold increase allows for such rVabs 
to be recoverable from the library upon batch analysis. 
An additional 10 4 -10 4 fold increase allows nanomolar 
affinities and agonistic activity. Accordingly, the 
preferred useful surface scanning libraries have on the 
order of about 10 12 entities. 
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It is recognized that antibodies have the ability to 
distinguish among closely related targets. Accordingly, 
recombinant libraries possessing sufficient numbers of 
entities are reached according to this invention by 
constructing recombinant libraries comprising variable 
regions of either, or both light (L) or heavy (H) chains 
which are modified or unmodified and which may or may not 
be expressed in combination with a constant region. These 
libraries may be selectively varied not only during their 
original construction, but also after the initial round of 
selection for any one or all of the three composite 
profile activities of target binding, selectivity and 
activity. Such secondary additional diversification as 
well as secondary simplification may be carried out by 
combinations of primer based PCR or oligonucleotide 
insertion at convenient restriction sites. Furthermore, 
the secondary variations may be localized to each of the 6 
CDRs (i.e. the three in VL and the three in VH) or any 
particular combination or singular location. Variability 
is introduced in the CDR^s by modifying the CDRs to 
contain random amino acid substitutions of positions 
involved in contact with the target. The positions of 
variation, including further diversification or 
simplification, are preferentially those within the CDR 
which do not alter the CSR structure of that region and 
are known to those skilled in the art. The number of 
amino acid positions to be diversified is dependent on the 
number of active rVab members desired to be obtained. 
Thus, if an insufficient number of members are identified, 
the library diversification can be increased by 
diversifying additional amino acid positions in a CDR as 
described below. 

Given that there are twenty naturally occurring amino 
acids, diversification at a single amino acid position 
results in about 2 0 different potential antigen binding 
35 (touch) sites. By diversifying at two amino acid 
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positions in each of the 3 VL CSR, 2 VH CSR f and the one 
VH CDRH3 which are randomly combined into VH:VL pairs by 
the invention, one obtains a diversity in the rVab library 
of *10 u members (see Pig. 4). Since a given phage library 
can package about 1X10 14 members, several libraries are 
5 preferably constructed and packaged in phage to contain 
the entire population of diversified members. Although, 
it is preferable to diversify two amino acids in each CDR 
as shown in Figure 6, other combinations are possible. 
Randomization in only some of the GSRs and one CDR allows 

10 f or library sizes approximating 10 12 such that one phage 
rVab library could contain multiple copies of each 
diversified member. In addition, three or more non- 
essential amino acids in a given CDR may be diversified 
(see. Figure 5 for non-essential amino acids) preferably 

15 with a corresponding decrease in diversification of amino 
acids in other CDRs so as to maintain the total size of 
the library within an attainable number. Resultant 
libraries of 4xl0 12 members can be approached using, for 
example, bacteriophage as vectors. A single rVab library 

20 of this invention of at least about 10 n members, 

independently of how diversity is obtained, provides 
enough surface probes with the minimum CAP at the target 
to allow identification of most active surfaces of 
interest . 

25 An advantage of this invention over prior art 

screening methods is that it scans the entire available 
surface of the target for active surfaces and provides 
active surface reporters. This allows for identification 
of active sites and SOMERS for targets without endogenous 

30 signals (endogenous ligand) and at target surfaces not 
used by natural endogenous ligands but which result in 
modulation of that target. The latter surfaces, referred 
to as allosteric surfaces, are of two types: those without 
activity in the absence of endogenous ligand binding to 

35 target (i.e., cryptic allosteric sites); and those having 
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activity on their own and yet are still able to modify the 
action of ah endogenous ligand (i.e., active allosteric 
sites) . Obviously, the larger the target surface under 
scrutiny, the greater the opportunity of finding 
appropriate active surfaces. As endogenous ligand contact 
surfaces probably represent some 10% of total target 
surface area, including allosteric surfaces greatly 
increases the surface area under investigation. 

The use of recombinant libraries also provides a 
means of reducing or increasing the number of 
complementary determining regions (CDR) within the 
variable domain of the rVab necessary to confer desired 
CAP attributes to the rVab. Thus, one can attain a 
minimal active CDR complement. Alternatively, large scale 
randomization of up to most of the amino acids within the 
rVab CDRH3 domain may be used to increase the population 
of active rVab from which to identify the best rVab 
reporter. For example, if the initial library screened 
does not possess members with the sufficient constellation 
of CAP attributes, secondary diversification of the best 
candidates, by a number of procedures including PCR and 
various In vivo and in vitro mutagenesis systems known to 
those skilled in the art, and then recycling through the 
original identification and selection procedures described 
below, may be used to recover an antitarget rVab with a 
full complement of the desired CAP which might have been 
too rare to be found among the original antitarget rVab 
library. In addition, by identifying and sequencing 
active rVab CDR complements one may also obtain accurate 
and detailed structural information useful for modeling 
the essential elements of active SOMERS, i.e., as in 
BEEPS . 

C. Affinity Of Recombinant Antibodies 
The rVab of the invention are used to detect and 
characterize active sites by providing information related 
to their structure, and/or to function as reporters in 
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competition assays to identify SOMERS. Accordingly, the 
affinity of the rVab's useful in this invention should 
allow for one or both of these functions. If the rVab is 
used only to detect and characterize the active binding 
site or to contribute in developing a BEEP, its affinity 
may be high and a slow dissociation rate (i.e., half time 
of dissociation, preferably between about 5 and 30) would 
be suitable. However, the affinity of the rVab's useful 
to identify SOMERS for a pharmacological target should not 
be so high as to prevent dissociation and competition for 
use in competition assays. Preferably, this affinity will 
be in the range of from about 0.01 to about 100 nM. More 
preferable the affinity will be between about 0.1 to about 
30 nM- Even more preferably the affinity will be about 0. 
5-10 nM. Most preferably, the affinity will be between 
about 1-5 nM. 

D. Characterization of Ligand And Target 
Binding Sites 

Binding domains on a signal are referred to as ligand 
20 attachment sites (LIGATTS) and those on the target as 

target attachment sites (TARGATTS) . Where each is protein 
in nature, both can be defined as the surface area of the 
entity made up of contiguous (e.g., amino acids n and n+1) 
or discontiguous (e.g., amino acids n and i where i is not 
25 n+1) elements so confined in space as to be accessible and 
in contact at the same time with the surface of the other 
partner in the complex so as to contribute to the binding 
energy of that interaction. Where there are multiple 
binding sites, by our definition, . each TARGATT domain 
30 forms contact points with amino acids on the signal and 

one SOMER would not be expected to encompass two LIGATTS. 
Where endogenous ligands are nonproteinaceous, other 
compound building blocks would replace the amino acid as 
the unit entity. 
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The sizes of LIGATTS and TARGATTS are quite variable. 
We have arbitrarily confined TARGATTS to the volume which 
can be encompassed by a synthetic small organic molecule 
replacement (i.e., SOMBR) of less than about lkD. This 
TARGATT size, is practical and modeled by the opiate 
receptors' attachment site for its 30 amino acid 
endogenous ligand, endorphin, which easily binds morphine 
(<600D) and all of the pharmaceutically known opiate 
analgesics, with nM affinity and is fully activated by 
their attachment. Identification and characterization of 
larger TARGATTS is considered within the scope of this 
invention as such sites should also be recognized by 
members of rVab libraries of this invention. 
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E. Association of Activity A+ with Bin ding of rVab 

■ An important feature of this invention is that the 
rVab's which are identified as possessing the desired CAP 
attributes and in particular, activity at a target, 
function to create a linkage between binding to a target 
and activity at that target. Accordingly, once an rVab is 
20 identified which is both T + and A + , that rVab may then be 
used to identify other ligands which are also T + and A + 
based on competition binding assays alone. 

Several methods are available to initially provide a 
connection between binding and activity of a rVab. In a 
preferred method, an active surface for a target is 
associated with a secondary biochemical response which may 
be detected upon binding of an active ligand at the active 
surface. Such biochemical responses may include changes 
in affinity of the ligand or allosteric ligands, 
oligomerization with other subunits, phosphorylation 
state, ion flux, etc. For example, and as discussed more 
fully below, the changes in agonist affinity of a receptor 
coupled to G protein based on the presence of a guanine 
nucleotide can provide the necessary linkage between 
35 binding and activity. 
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Also, as discussed in U.S. Patent 4,859,609, which is 
incorporated herein by reference, receptors may be 
expressed as fusion proteins comprising the ligand binding 
domain of the receptor fused to a "reporter- polypeptide 
which undergoes an assayable change in conformation or 
5 function when the active ligand binding domain of the 
receptor binds to an agonist or antagonist. 

IV. Method of Identifying SOMERS 

The method of obtaining small organic molecules 
10 (SOMERS) which are active at pharmacologic targets is 
summarized as comprising the following (See FIG. 1) : 

snaae I (a) : Construction of the scanning rVab 
library. 

Stage I (b. c) : Identification of rVab's which bind 

15 and activate target. If target is a multivalent site 
requiring attachment at two sites, pairs of rVab's are 
identified using rVab-peptide scanners to detect activity. 

Stage II : Use labelled rVab's as reporters to detect 
SOMERS or MULTIMERS (i.e., DISOMERS) . 

20 Stage III : Create BEEPS from composite of structural 

information derived from rVabTSA* for screening or 
synthesizing SOMERS or MULTIMERS. 

A. Construct ion of Scanning rVab Library (Stage la) 
Molecular biology technology is used to construct a 

25 limited number of large combinatorial libraries of 

recombinant antibodies (rVab libraries) wherein the VL and 
VH CSRs and CDRH3 occur within each library within a 
single Ig VH and VL framework, respectively, and 
optionally attached to their respective constant region 

30 (CHI and CD . An antibody whose structure has been 
determined by crystallography is preferably used to 
provide the framework for construction for these rVab 
libraries. Antibodies of undetermined structure can also 
be used for library construction and identification of 

35 active rVabs (i.e.. Stage 1 abc. Fig. l) useful as 
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reporter rVabs to detect SOMERS and other MULTIMERS (Stage 
II Fig. 1) according to the process of the invention, but 
only antibodies of determined structure can be used in 
creation of BEEPS (Stage III, Fig. l) . 

In the preferred embodiment of the invention, 
antibodies of solved structure are used to create the 
original rVab library. In another embodiment, one or two 
of the isolated active rVabs for a given target are 
subsequently crystallized and the structure determined to 
allow their use in Stage III. The later is useful as it 
allows use of the newly published sequences of the human 
VH and VI* genes [Tomlinson et al . 1992; Williams and 
Winter 1993; Cox, Tomlinson and Winter 1994; Nissim et al . 
1994; Tomlinson et al . 1994] for Stage III work. 

In all cases, the rVab libraries constructed by the 
process of the invention have a sufficient number of 
diverse members to encompass an immunological antigenic 
repertoire approaching man's natural one or are made from 
human VH and VI* genes [Roitt, 1991; Nossal 1993; Griffiths 
et al., 1994] which are capable of recognizing an enormous 
diversity of surfaces including but not restricted to 
proteins, nucleic acids, carbohydrates, lipids and organic 
haptens . 

There are basically three sources of genes to be used 
as the starting material for construction the rVab 
1 ibrar ies . 

a) the published data on cloned and sequenced 
antibodies; 

b) the antibody clones themselves, carried in 
various cell types, including hybridomas, spleen cells, 
bacterial plant cells, yeast and viruses, on various DNAs 
including plasmids, phagmids and chromosomes; and most 
recently 

c) the published sequences of a human repertoire of 
VH and VL genes [Roitt, 1991; Tomlinson et al. 1992; 
Nossal 1993; Williams and Winter 1993; Cox, Tomlinson and 
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Winter 1994; Griffiths et al., 1994; Nissim et al . 1994; 
Tomlinson et al . 1994]. 

Most of the sequence information is available in at 
least two data bases, i.e., the Brookhaven Protein data 
base and that of Kabat at NIH (which is also available in 
text form) [Kabat et al. 1991]. The structure of the 
majority of the crystallized antibodies is also available 
from the Brookhaven Protein data base. Listings of such 
crystallized antibodies are presented in Example 1. An 
example of an antibody which has been crystallized to 
determine its structure is described in (Tulip et al. L 
Mol. Bio. . (1992) 227:149-150). 

In the preferred embodiment, the antibody sequence is 
obtained first and is the starting point of rVab library 
construction using the following steps to construct the 
15 rVab library. The order of steps may be varied to suit 
particular circumstances. 
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I. Selection of Parental Fabs of known crystalline 
structure as rVa b library framework templates 

II. Creating the Nucleic Acids Encoding the Heavy 
and Light Chains (rVHCHl and rVLCL) for ABXXX 
rVab.lib. 

Step 1 (a) : Construct ion of 5.'VL Section 

Step 2: Diversification By PGR 

Step 1 (b) : Construction of the MIDVL section 

Step 1 (c) : Construct ion of the 3'VL section 

of rVL 

Step 3: Ligation 

III. Construction of the Constant regions of ABxxx 

IV. Construct ion Of rVHCHl.lib (Fig. 8) 
Construction of 5' ha lf of the VH Region 
Construction of the 3' Half of the VH Region 



35 



V. VH and VL libra ry sizes: 
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VI. Construction of the rVab.lib (the VHCHllib x 
VLCLlib combinatorial lib.) (Fig . 11 . 12 . 14) 

Step 4: In vivo recombination of VHC Hi and VLCL 



Details of the Individual Steps for Expressing 

the rVLCL.1.6 and rVHCHl.L.b by CRE-LOX 

RE COMBINATORIAL FORMATION 

VI. Step 5 - Generating Phage and Displaying the 
rVab.lib on Phacre Surfaces (Fig. 14) 

The critical steps are shown in Figs. 7, 8, 11 and 14 
which describe respectively the construction of rVLCL and 
rVHCHl libraries, their pairing in the rVab library, and 
finally their expression attached to the surface of phage 
as functional complexes. 

Both construction of the rVLCL and rVHCHl libraries 
follow a similar outline wherein: 

a. a limited number of oligonucleotides are 
synthesized containing convenient restriction sites and 
which cover both ends, and in one case the middle domain, 
of the V region, 

b. the oligonucleotides are ligated together, 

c. PCR is used to append missing and junctional 
regions as well as provide the means of randomization of 
amino acids at defined positions, 

d. the completed rVH and rVL libraries are ligated 
to appropriate constant domains wherein one- library is 
placed within a plasmid and the other phagmid, and 

e. the rVH and rVL libraries- are combined in vivo 
by the CRE-LOX recombinase provided by coinfection by PI. 

Following this outline, rVab libraries of about 10 12 
members are constructed. 

In other embodiments, 

a. the VH and VL genes, without constant regions, 
encoding an antibody of known structure are cloned via PCR 
to obtain the sequences encoding the VHCHI and VLCL 
sections of the lgs using methods known to those in 
skilled in the art, and 
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b. the Vs may then be altered via PCR to remove 
unwanted restriction sites, and develop convenient 
restriction sites bording the CSR and CDR domains. 

c. selectively randomized oligonucleotides with 
appropriate end positional restriction sites may be used 
to replace each of the 6 CDR regions having appropriate 
matching restriction sites in the basic V framework to 
allow directional cloning. These oligonucleotides vary in 
length (i.e., n, n+1 and n+2) to match the known CSR and 
some length changes in CDRH3 and contain all of the amino 
acids at one or two positions within each CDR most often 
involved in antigen contact. 

In the preferred and other embodiment, with 2 amino 
acid randomizations within each CSR and CDRH3 and three 
different lengths of CDRH3 used, the numbers of diverse 
members in the final rVab LIB (i.e., rVHCHl x rVLCL) reach 
10 18 (see Fig. 4 for details) . 

1. Sources of Frameworks 

Frameworks in which the optimally diversified CSRs 
and CDRH3 are cloned into may be derived from antibodies 
of known structure. 

Frameworks may be chosen from antibodies which 
present the canonical regions in different orientations 
with respect to the C region. Thus, it may be desirable 
to prepare multiple rVab libraries on different frameworks 
to maximize different special orientations of the CDR's. 

Frameworks may be chosen which will favor binding 
over small to large surface areas. As discussed above, a 
small surface area would cover an area of about 200 A 2 , a 
medium surface area about 750 A 2 and a large surface area 
about 1500 A 2 . Examples of antibodies which can provide 
frameworks for these three different size targets are 
found among the planar, cavity and grooved type antigen 
recognition domain present in various antibodies of known 
structure (Fig. 3 respectively). Frameworks may be chosen 
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simply based on the shapes of the antigen recognition 
domain or in combination with other structural factors. 

2. The Expressible Vab Region Construct: 

Preferentially, construction may be done in one of 
two general type vectors, 

a. fd and M13 (Pharmacia, USA [Smith, 1985; Scott 
and Smith, 1988; Parmley and Smith 1988; Cwirla, et al . , 
1990, McCafferty, et al . 1990; Winter and Milstein 1991; 
Waterhouse, et al. 1993, Recombinant Phage Antibody System 
Instruction Manual, Pharmacia P-L Biochemicals , USA] . 

i. the inserted V(H and L) with CHI at the 
carboxy terminus preceded by the lac promoter and a 
ribosomal binding site [RBS] , an export leader sequence in 
front of gpIII phage coat protein or PelB, a cloning site 
followed by either an in frame linker and then gpIII, or a 
double set of suppressible termination codons. 

ii. the VH or VL without CHI or CL or with 
partial NH2 terminal constant region amino acids may be 
preceded by the lac promoter -RBS-PelB-with internal 
cloning sites allowing in frame ligation of VH at both 5' 
and 3' ends and followed by -C(H or L) and either an in 
frame linkage to gpIII or two suppressible termination 
codons . 

b. immunozapll (lambda) Stratacycte, CA, [Skerra, 
and Pluckthun 1988; Mulinax, et al . 1990, ImmunoZap 
Cloning Kit, Instruction Manual Stratacyte Corp. CA USA; 
Kang, et al. 1991; and Barbas, et al . , 1991J . 

i. as above for V region, with and without 
intact CHI. 

ii. as above for V region, with and without 
intact CHI. 

Expression of Single V(H, L) -C(H or L). 

Expression of single V(H or L) -C peptides may be used 
to confirm proper construction of the V regions, or rVHCH 
35 and rVLCL libraries, before either expression as mature VC 
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(rVHCH or rVLCL) or CRE-LOX recombination and phage 
expression. fd (M13) or Lambda expression is induced with 
glucose as described in Pharmacia (USA) Kits or the 
Stratacyte <CA) system Lerner. The product may be 
identified with CHI antibody (standard Elisa technology 
known to those skilled in the art) either with fd as phage 
displayed molecules, or with lambda after expression 
induction, and generation of periplasmic located 
molecules. When using phage, the induction of the lytic 
cycle may also be used to determine the ratio of lambda to 
intact rV as an indication of size of library. With fd, 
one can assay antibiotic (e.g. ampicillin) resistance 
colony forming units (cpu) transfer from within fd genome 
vs. the number of phage with rV display attached to the 
viral surface. Dishes coated with viral or rV antigen may 
be used to provide information on the size of the rV 
library. 

In another embodiment, only the rVH and rVL domains 
are expressed and connected through a flexible linker to 
form a single chain V region antibody (termed scFv by 
Winter [Huston, et al . 1988; Bird, et al. 1988; 
McCafferty, et al . 1990; Hoggenboom, et al . 1991; Barbas, 
et al. 1991; Garrard, et al . 1991; Breitling, et al. 1991] 
which may be expressed using phage display. The expressed 
V antibodies are fused to gill on M13 using a Recombinant 
Phage Antibody System Kits (Pharacia, USA) , according to 
instructions provided the manuf acturerer for construction, 
expression and detection. 

c. General information on primer use and PCR. 

To allow the library construction of various domains 
of rVH and rVL, and CH and CL as well, each primer 
includes a sequence encoding a restriction endonuclease 
recognition site. The sequence of the primer which 
contains the restriction site may be located within, 
partially within, and sometimes precedes the section of 
the primer annealing to the target Vab sequence. When it 
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is present as an extension to the sequence homologous to 
the rV section under construction, it will not participate 
in annealing during first strand forward and second strand 
reverse synthesis but will participate in annealing 
subsequent PCR amplification cycles. Although not 
essential, the restriction sites (at either or both ends) 
are such as to generate 3' or 5' overhangs to aid in 
subsequent ligation utilizing restriction enzymes which 
maintain the appropriate reading frames. Products of PCR 
may be isolated from the reaction mixtures by a variety of 
techniques known to those skilled in the art. A number of 
restriction sites which have been successfully encoded 
within rVH and rVL gene constructs for insertion in the 
available expression vectors are known to those skilled in 
the art and are available from manufacturers of IG 
expression systems and Ig primers such as Pharmacia (USA), 
Stratacyte (CA) , and 5' -3' Prime (USA). 

Insertion in frame can be into vectors containing 
sequences encoding other proteins to produce fusion 
proteins not only containing one or more C constant 
regions, but also the coat protein gpIII and VIII of fd 
filamentous phage, or transmembrane proteins to provide 
rVHCH or rCLVL anchoring for appropriate extracellular or 
phage displays. 

3. Preparation Of rVabs With Multiple 
Attachment sites 

The grouping of active rVabs based on recognition of 

different target surface domains is simplified by using 

small peptides which cover in an overlapping fashion, the 

liner amino acid sequence of the target. Such grouping 

simplifies the pairing of active rVab for a MULTIMER (e.g. 

DISOMER or TRISOMER) obtained from multivalent rVab-PEP 

libraries (example 3 and 4) as well as forms the basis of 

selection of active rVab for conversion to reporters for 

simple SOMER identification. 
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Given that many antigenic sites are less than 12 
amino acids, peptides of 10-20 amino acids, made in 
overlapping fashion (i.e. amino acids 1-15, 5-20 10-25 
etc.) would provide most of the sequential target 
epitopes. This would mean that for an average protein of 
50,000 Kd, i.e., some 90 would be needed to cover the 
entire surface. For many pharmaceutical targets, 
mutagenesis and alanine scanning has provided information, 
known to those skilled in the art, of particular amino 
acids, and small groups of amino acids which are involved 
in signal binding and receptor activation. Such 
information is used here to reduce to a much smaller 
number the peptides needed to provide most of the desired 
surface epitope information. Another possibility for 
target fragmentation is the use of synthetic polypeptides, 
15 bought commercially or produced by biotechnology means, 

using commercially available expression vectors harboring 
specific sites for cloning and expression of peptides in 
fusion with easily and quantitatively recoverable 
proteins. 
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4. CSR And CDR Diversificat ion And Reduction 
CSR and CDR Randomization: A preferred embodiment 
will be to use synthetic oligonucleotides which vary at 
increasing number of amino acid positions within each CSR 
and CDRH3 but which do not alter the CSR. Minimal 
randomization of amino acids would be to have only 1 
position within each CDR filled with all 20 amino acids. 
One could include up to about 24 amino acid positions 
within the CDR H3 . As the number of positions randomized 
increases, the total possible different rVH and rVU 
rapidly exceeds the practical limitation of I0 n ' 14 on phage 
library size, and one has to limit the number to fit 
within the library size that is attainable. Increased 
randomization at larger number of positions can be 
accomplished by putting amino acids into classes, i.e.. 
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basic, acid, hydrophobic, hydrophilic, etc., and then 
using only one or two amino acids of each group at each 
'randomized' position. Secondly, since not every amino 
acid within a CDR is involved in contact, one can identify 
those which are most often involved in contact and focus 
amino acid randomization at those positions. Lastly, one 
does not need to use the same type or degree of 
randomization for all CSR and CDRH3s. In one embodiment, 
one could use only CSRH1, H2 and H3 for randomization as 
VHs alone have been published to have nM antigen affinity 
[Ward, E.S. et al. 1989]. 

In the preferred embodiment, randomization may be 
accomplished during construction of the rVab library. In 
addition, secondary randomization after isolation of the 
initial active rVabs may also be utilized if desirable. 
Secondary randomization can be used to obtain a single, or 
pairs of missing attributes of the desired TSA CAP, or to 
increase or decrease one or more present CAP attributes. 

CDR Reduction: To determine the smallest target 
binding domain it may be desirable to reduce the size of 
the potential rVab target binding domain. For CSR and CDR 
reduction there is the possibility of using only one VH or 
one VIj, making PCR copies, cloning with primers which 
include only the first, first two, or last one or two CSR 
and CDRs within rVH and rVL, and subsequently ligating the 
constructs into parental frameworks wherein the missing 
CSR or CDR has been replaced with a string of glycines 
(Winter EP 0 368 684 Al) . After alteration each library 
may be retested for its new CAP. In another approach, one 
can start with a preferred rVHrrVL pair and delete (again 
replacing each with a glycine heximer) a) one CSR or CDRH3 
at a time (there being 5 such possibilities) ; b) two at a 
time (there being 14 such possibilities); c) three at time 
(there being 9 of these) ; and d) four at a time (there 
being 6 of these) . With reduction in CSR and or CDRs, the 
potency of the altered rVab can be tolerated up to 30nM, 
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° (that required for use of the rVab in subsequent binding 
screens for organic replacements) . However, an affinity 
of 100 nM is tolerable in the minimal CSR/CDR combination 
if it is put through mutagenesis for potency improvements 
later on as such processes have been shown to produce 
5 increases in binding affinity of up to two orders of 

magnitude [Bass, Greene and Wells, 1990; Marks, et al. 
1991] . The reduction in number places all of the critical 
contact atoms within the smallest number of semifixed 
domains making 3D modeling of critical atomic spacial 
10 relationships easier by means known to those skilled in 
the art. 



5 . Express ion of rVabs 

a. Expressio n of rVah as a phage library , 

15 m one embodiment of this invention, the rVab library 

is displayed on phage. This process is best described 
recently by Griffiths et al. 1994). Methods for using 
phage display of antibodies have previously been published 
(see, Ladner et al.. International patent application 

20 W090/02809; Winter et al . W092/20791 and Huse et al . 

W092/06204 which are incorporated herein by reference) and 
some reagents are commercially available in kits. 

In another embodiment, only the rVLCL is placed in a 
library for expression as a bacterial plasmid construct 

25 (VLCL.bact) with a leader which allows product release to 
the periplasmic space. This library is then expressed and 
product is combined with either one of a rVHCHL or rVLCL 
phage displayed library to derive the two phage and one 
soluble protein libraries. An anti-CL antibody attached 

30 to solid matrix may be used to harvest their VLCL protein 
library. 

To identify members of the rVHCHl phage library with 
one or more CAP attributes, the soluble rVLCL protein 
library is added to the above phage library and panned for 
35 target surface recognition with target protein attached to 
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a matrix (plastic, chromatographic or magnetized beads) in 
the absence or presence of competing proteins (see example 
2) to derive rVHCHl : rVLCL protein T+(S+) members. The 
phage containing the rVHCHl gene is harvested after 
allowing the phage to multiply with helper (using 
commercial kits) . Isolation and enrichment steps may be 
repeated as required. This library may be referred to as 
T+ S+ rVHCHhal f LIB. Assays for A+ may be then be done to 
obtain TS A+ rVHCHhal f LIB . 

The T+ (S+A+optional) rVHCHhal f LIB may then be cloned, 
in, for example, lambda and expressed as periplasm soluble 
entities. The library may then be mixed with the phage 
display rVLCL - LIB , and the above isolation steps repeated 
to obtain a T+ (S+A+) rVLCLhalf LIB. The specific 
methodology for this procedure has been published by 
Lerner and group [Cabilly, et al. 1984; Burton, et al. 
1988; Huse, et al . 1989; Mullinax et al . 1990; Zebedee, et 
al. 1992; ImmunoZap Cloning Kit Stratacyte Corp. CA. , and 
Surf Zap Cloning Kit (instruction manual) Stratagene Corp, 
CA] and is herein included in entirety by reference. See 
below section on Functional VH and VL combinations for 
details. The active rVHCHhalf library is cloned into 
pVHACCEPTOR (see Fig. 11) and the active rVLCHhalf library 
into pVLACCEPTOR. The CRE-LOX recombination system may 
then be used to derive a rVab LIB combinatorial library 
which may be tested for the TSA+CAP. 

b. Isolation Of rVab Library Of Target Binders And 
Phfrge DjgplflY 

This step isolates all rVab existing within the 
original library which recognize some part of the target's 
surface and form a complex with sufficient stability for 
isolation (i.e, target affinity <30nM) . Those with this 
recognition ability are termed T + . In the preferred 
method, the rVab genes are mixed and packaged in and 
displayed functionally on phage surfaces. Accordingly, 
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rVab are displayed on the surface of phage and the phage 
are incubated with target surfaces. In other embodiments, 
the library number can be reduced by prior selection of 
active rVHCHhalf LIB and rVLCLhalfLIB which allows 
packaging and expression in bacteria, either as soluble or 
membrane anchored rVabs by methods known to those skilled 
in the art using commercially available kits (e.g. 
Stratacyte USA) following manufactures directions. 

As discussed above, the target can be any surface one 
desires to scam for recognition by members of the rVab 
phage library. Permissible incubation conditions, of 
which there are many known to those skilled in the art, 
would include those which do not disrupt the vehicle 
packaging the rVab, or inactivate rVab recognition of the 
target, nor prevent display of its target epitopes. In 
addition, in all cases the rVab: target complex preferably 
is one which is quantitatively separable from free Tr-rVab 
phage packages. 

After incubation of target and the rVab phage, there 
are many published methods for separation of complexes 
known to those skilled in the art which are all based on 
the principle of having the target tagged (denoted Tr*") 
in such a form as to allow its convenient quantitative 
separation from all reaction solutes. Preferably such 
tags are inseparable or act as labels to follow the 
target :rVab complexes through separation procedures. 
Among such preferred tags are matrixes such as agarose, 
magnetic beads and the surface of culture dishes. In 
these cases attachment of the target to the tag would have 
been made prior to incubation with the rVab library. 
There are also non-matrix target- tags which allow 
target :rVab complex separation from solute and unbound 
rVab. Among such tags are fluorescent compounds, (for use 
in fluorescent activated sorting), biotin, (for avidin 
directed sorting) and polyhistine containing 6 residues 
[his 6] (for metal chelate column chromatography) and very 
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small antibody epitopes which are known to those skilled 
in the art. 

Incubation conditions can be varied extensively. 
Variations in temperature, time, pH, buffer and media 
additives are all to be considered as those attributes 
which influence target :rVab complex formation and 
stability in manners known to those skilled in the art. 
The preferred conditions here are phosphate, MOPS , Hepes 
or Tris buffer at about neutral pH (6.8-7.2) with 1% BSA 
at room temp, for about 4-6, up to about 6-12 hrs. 

After formation of target :rVab complexes, any matrix 
bound rVab is separated from unbound free rVab. In the 
preferred embodiment, the target is attached to plastic 
culture dish surfaces, and one of any number of rapid 
procedures, such as panning, is used for separating free 
and target completed rVab. The general approach of 
panr.ing at different temperatures, pH and the presence of 
the antigen have been shown to allow isolation of rVab 
with controlled affinity. 

After detachment from matrix or affinity associated 
tag, by procedures such as low pH or others known to those 
skilled in the art, the recovered rVabT* can be recycled 
through the selection procedure or any variant thereof any 
number of times. Published panning and affinity 
chromatographic procedures have shown single step 
enrichments of 5 x 10 2 -10 3 per cycle. Although, the 
number of cycles can be varied, depending upon the 
enrichment found per cycle, the abundance of a particular 
rVabT*, the total size and diversity of rVabT* recovered, 
3 cycles is preferred. Other number of cycles may be 
chosen based on recovered rVabT* characteristics such as S 
or A. 

Isolation of rVabT* members can be done with 
different types of packaged rVab expressing functional 
rVab, including phage packages or soluble entities as 
discussed earlier. 
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In this preferred isolation step, the rVab can have 
one of the following functional forms Fv(rVCvh or vl 
only) , Fab, or scFv as described below: 

a. Single functional VH or VL without (Fv) or with 



associated constant regions (Fvc) for the V heavy (CHn) 
5 and Vlight (CL or k) genes or parts thereof. Both types 
of F can recognize targets using only three of the six V 
region CDRs present in a natural Fab. In the preferred 
case, these Fvc genes are first packaged in fd phage and 
expressed with the C region (or some part thereof) 

10 attached, and in frame, being respectively a CHn, CL kappa 
or CL lambda) in which in all cases the constant regions, 
are devoid of their C terminal cysteine. There are a 
number of CH regions available including CH gamma, or 
delta, selected based on the required solubility and known 

15 to those skilled in the art. These V(or VC) genes can be 
expressed as soluble entities with or without tags or, as 
in the preferred case, fused, in frame, to one of the 
phage's coat proteins (i.e., gpIII) for functional 
display. These libraries comprising only V regions are 

20 termed rFv and may be expressed packaged in phage for 

phage display. rFv phage libraries may be screened for 
members possessing CAP attributes of T, S and A and may be 
further diversified as described above. Such libraries 
with entities containing a reduced number of CDR or CSRs 

25 may be derived as part of the secondary simplification 

process when there are a very large number of active rVabs 
or when simplification is desired to foster the 
development of a more accurate BEEP. 

30 c. Functional VH and VL co mbinations (rVab) : 

These combinations have two V genes with, or without, 
partial or intact constant genes. Although they may 
contain like members, .the preferred combination is one VH 
(or VHCH) and one VL, (or VLCL) . In the preferred method, 

„ rVabT* with the particular VH and VL couple are co- 
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packaged in a single phage, on a single piece of DNA, as 
two individual gene products. For each phage, either VH 
and VL, may be expressed as soluble protein and the other 
attached to gpIII to cause surface phage display of the 
Fab. This coupling and expression of VH and VL can be 
made with or without identifying separately the VH phage 
library and VL phage library which can recognize the 
target when in the presence of a library of soluble VLCL 
protein or VHCHl protein respectively (see, supra) . 

In one embodiment, the sequential procedure to obtain 
functional rVabs is as follows: Three individual libraries 
are made. Two of >10 7 phage packages each expressing and 
containing only one V gene (VHCH or VLCL) attached to 
phage for surface display. The other is of the same size 
but is made of VL genes expressed in lambda as soluble VL 
proteins which can be harvested from periplasm of bacteria 
expressing the VL soluble library. First rVabs are then 
made by mixing the soluble protein library with the VH 
phage library in solution prior to testing for target 
recognition. This mixing allows all VL proteins present 
to complex with any one VH expressed on a single phage 
surface package to form a phage attached noncovalent 
(disulfide bonds excluded) functional rVab. This allows 
the formation of all possible rVab combinations. To this 
mixture is then added the matrix associated target under 
study and after incubation, and complex formation, all 
phage carrying a matrix associated target displayed as 
part of rVab displayed on the surface of phage are 
isolated, preferably by one of the above noted panning 
procedures . Subsequent isolation of phage DNA gives an 
expressible library of functional rVHCHl phage which can 
be T+ (i.e., the T*rVHCHhalf LIB) . 

All the VH phage library inserts, before or after 
phage amplif i cat ion as needed, are next excised via simple 
endonuclease restriction digestion, and directionally 
cloned into a lambda able to express inserts as soluble 
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periplasm proteins- After induction as noted above, the 
protein from the T + rVHCHhal f L IB is harvested from the 
periplasmic space to give a protein library with the 
potential to form complexes with the entities within the 
other original phage library, i.e., the rVLCL library. 
After mixing the soluble protein library and this phage 
library, as above, the T + rVLCLhalf LIB is isolated, as 
noted above. 

In the final step of this embodiment, a combinatorial 
library of packaged pairs of T + rVab is produced in which 
individual packages contain one VH and one VL pair of 
genes co- expressed as separate entities but associated 
together in functional rVab complexes. In the preferred 
embodiment of this procedure, these two genes are combined 
via the CRE-LOX recombinase system reported originally by 
15 Hoess [Hoess, et al . 1982, Hoess and Abremski, 1985; 

Hoess, et al. 1986] and recently by Griffiths et al . 1994, 
which are included herein by reference. In another 
embodiment, the package is also a phage, and expression is 
similar to the preferred embodiment but in this procedure, 
the combinations of rVHCH and rVLCL are made by excising 
and ligating in vitro the DNA in a fashion which allows 
randomization of VH and VL pairs but only one pair per DNA 
construct. These constructs can be phagmid or phage to 
allow either bacterial or phage expression of the rVab. 
In bacteria the rVab are isolated and tested by protein 
lifts, whereas in phage, the rVab is attached to a surface 
protein for display and assay. Both methods have been 
published [Hoogenboom, et al. 1991; Rang, et al. 1991; 
Waterhouse, et al. 1993; Figini, et al. 1994; Jespers, et 
30 a i^ 1994] and are commercially available in kit form (e.g. 
Stratacyte, CA) . The preferred method is phage display of 
rVab. 

The advantage of the embodiment in which active 
rVxhalfLIB are identified before combining them into 
35 rVabs, is that where combinations of VH and VL are made 
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randomly from a preselected T+ active rVhalf library, the 
independent preselection of active VHCH T + and VLT + genes 
is likely to have reduced the number of active rVhalfLIB 
members to less than 10 5 "*. This reduction in number 
greatly increases the chances of deriving within a single 
phage library of 10 12 members, which is attainable with the 
methodology disclosed herein, all possible active rVabs. 

The procedure used to isolate single VH and single VL 
and pairs of VH/VL which recognize the target has the 
added benefit of being rapid, and controllable as to the 
strength and nature of Vab target binding that is desired. 
By the procedures outlined, a paired rVabT + (containing s 
about 10 10 entities) can be generated. 

The procedures discussed above result in the 
isolation of a) rVH or rVL, which alone do not need the 
other to recognize the target, and b) the recombinantly 
derived combinations of rVH and rVL termed rVabs and scFv 
which, in the later case, have rVH and rVL linked together 
by a short peptide chain and expressed as gpIII phage 
protein fusion products or even as soluble entities. 
Additionally, rVab in which both V domains are of one 
type, i.e., either VH 2 or VL, 2 are possible by this 
invention. VHVH Fab have been reported with increased 
solubility. Altering CHI for CH delta regions or changing 
specific and identif iable C amino acids, could also 
facilitate expression of novel rVabs. 

The basic and preferred technology for cloning 
individual heavy and light chain variable regions either 
alone, or attached at their N terminus to leader 
sequences, or parts thereof, or at their C terminus 
attached to a constant region, or parts thereof, and 
placement into suitable expression vectors, transf ormation 
and expression in a compatible host cell in active form by 
recombinant DNA technology are described in the art. See, 
Huse WO92/06204; Ladner WO90/02809; Winter WO92/20791, 
which are incorporated herein by reference. 
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To achieve high yield and faithful cloning of each 
active IgG, secretion of protein either as soluble 
extracellular protein or in the periplasmic space is 
suitable. In addition, protein may be expressed as an 
extracellular (or on the surface of phage) facing 
transmembrane or membrane -anchored functional protein 
which allows spontaneous dimerization of heavy and light 
chain intact IgG or V domains. 

Methods of cloning from naive or immunized animals, 
entire spleen repertoires of Vab heavy (Vabh) and Vab 
light (V AB 1) in their natural or random pairings to derive 
enormously diverse combinatorial repertoire libraries are 
known in the art. [Huse, et al. 1989; Sastry, et al. 
1989; Milstein 1990; Clackson, et al. 1991; Marks et al . 
1991; Winter and Milstein 1991; Hawkins and Winter 1992; 
15 Hoogenboom, et al. 1992; Lerner, et al. 1992; Marks et al . 
1992; Winter, et al. 1994] . 
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B. Identification of rVab's Which Bind 
And Activate Tar gets (Stage lb) 

In a preferred embodiment of the invention, pairs of 
VH and VL antibody domains (rVab) are selected both as 
biological scanners of specific target surfaces and 
information reporters of activity related to the molecular 
3D structure of the antibody site involved in surface 
interactions as well as the molecular 3D structure of the 
active elements of the binding site. This structural 
information is relevant to identifying the minimum 
structure of the LIGATT, which would need to be 
incorporated into a SOMBR or DISOMER, to reconstitute the 
CAP of the active rVab and regulate the target in the 
desired fashion. This invention identifies the unique 
ability of rVab when used as libraries containing at least 
about 10 10 members to identify those portions of a target's 
surface connected to function in such a manner as to 
immediately provide the tools necessary and sufficient for 
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screening for organic replacements at the target with a 



uses genetic algorithms to construct 3D high resolution 
molecular models of the shapes of organic molecules which 
can fit into the active target and regulate activity so as 
to electronically screen for or synthesize via computer 
programs S0MERS or DiSOMERS . 

Active target landscapes are those surfaces connected 
to target function as defined as those able, when occupied 
by a ligand, of influencing target activity. It is known 
that antibodies, in a wide variety of forms, e.g. Ig, 
Fabj, Fab, or sFv (i.e., VH or VL alone), have exceptional 
selectivity as well as high affinity for their targets . 
This invention uses rVab which are identified as 
possessing the desired CAP attributes in two ways. 
Structural characteristics of multiple rVab's identified 
as possessing the desired CAP attributes are combined to 
produce a composite structural map which is used to define 
a BEEP. In addition, individual rVab's which are 
identified as possessing the desired CAP may be labelled 
so that they may be used as reports in competitive binding 
assays to identify SOMERS, DISOMERS or other ligands 
active at the pharmacological target. 



The approaches to isolation and identification of Vab 
for targets having endogenous ligands and rVab processing 
all TSA+ attributes, are divided based on two fundamental 
issues: first whether the rVab induced target 
modification is allosteric (alloA) or competitive (compA) 
with the native signal (endogenous ligand) and second, 
whether the active surface is a simple or complex 
landscape found one or more different submits of the 
target. Target modification is considered anything which 
alters target activation by any means including native 



desired CAP. In addition, an embodiment of the process 



Identification of rVab's with TSA+ 
For Targets Having End ogenous Ligands 
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signal recognition (i.e., signal binding) and/or the 
signal transduction process directed by the active target. 
For example, the binding of ACh to the muscarinic subtype 
1 receptor and the interaction and activation of the Gi 
protein, respectively. In both cases , the process uses 
5 libraries already selected for, preferably by batch mode 
selection, target recognition i.e., rVabT+, Batch mode 
selection is preferably than used to identify and separate 
rVabT+A+ from those which are inactive under specified 
conditions. Libraries of 10 6 to 10 12 individuals are used 

10 and the process is therefore applicable to rVab libraries 
which have both VH and VL chains, noncovalently (as Fab) 
or covalently attached (as scFv [Hoston, et al . 1988; 
Bird, et al. 1988; McCafferty, et al. 1990; Hoggenboom, et 
al. 1991; Barbas, et al. 1991; Garrard, et al. 1991; 

15 Breitling, et al. 1991] or diabodies [Holliger, Prespero, 
and Winter, 1993] as well as those with only one V chain. 
By methods known to those skilled in the art, individual 
rVabTSA+ within an active rVab A+ library (LIB) can be 
simply and rapidly isolated, assayed, tagged and used to 

20 

screen various chemical libraries for SOMERS which compete 
with rVabA+ for binding to the target. 

For allosteric Vab- modulators, the presence of 
allosteric activity within a rVabT+ library is indicated 
by the occurrence of an alteration in the association 

25 between rVabT+ and the target induced. by the binding to 
the target of another entity. This entity could be the 
native signal or any known target effector entity. 
Examples of allosteric entities include such nucleotides 
as ATP for receptor containing kinases, or GTP for 

30 G-protein associated targets, or a protein which couples 
to the target during signal transduction such as 
G-proteins, or even other receptor subunits. 
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a. Identification of rVabTSA* from 

rVabT+ using allosteric modifiers 

The isolation of rVabTA+ from rVabT+ is tied 
directly to the action of the signal at the target. In 
the preferred process, matrix- linked target (m-Tr)is mixed 
with the rVabT+ and incubated so as to allow m-Tr:rVabT+ 
complexes to form. In general these are the same 
conditions used to isolate rVabT+ in Step I (b) • After 
sufficient time to allow appreciable complex formation, 
which may or may not be sufficient to allow the 
interaction to come to equilibrium, the temperature is 
lowered to about 4°C so as to trap bound rVab in the 
m-Tr:rVab complex by slowing its dissociation rate. With 
the temperature at 4°C, free rVab is rapidly washed away 
and the complex is resuspended in original buffer. This 
process is done quickly and uses a matrix such as, for 
example beads or plastic surfaces, and takes <l min. For 
this process, preferentially one first determines or 
estimates the normal dissociation rate of rVabT+ from the 
target. This may be determined by methods known to those 
skilled in the art. For example, in parallel reactions, 
the dissociation constant (k-j) for target (Tr) and signal 
are determined using either a labeled target (T*) and 
monitoring the dissociation of T*-rVabT + -matrix complexes, 
or unlabeled target and following its release from the 
rVabT*-matrix complexes using anti-rVab constant region 
antibodies (or anti -phage antibodies) or by simply 
assaying phage in the supernatant if a rVab phage library 
is used. The half time (t^) for k-, at 4°C for rVabT* 
library from the target, for the entire population, is 
then determined. 

With the t % for k-, known, a new population of washed 
rVabT+ -matrix complexes of the entire rVab library are 
formed at 4°C and allosteric effectors are added in 
saturating concentrations. Half the population is 
centrifuged to isolate the free rVabT+ members from the 
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library which remain in the supernatant within about the 
first minute (or £l/30th) of the population's dissociation 
t*. The remaining half is allowed to dissociate for about 
10 x t*, centrifuged and the pellet resuspended and 
allowed to dissociate for about another 10 x t (y4 to isolate 
the second population of free rVabT+. In both cases, 
centrifugation is used to rapidly isolate the free rVabT+. 
In the first instance the free rVabT+ library is enriched 
for those rVab members induced to rapidly dissociate, 
referred to as rVabT+A+ allofast, while the second is 
enriched for those which have been induced to dissociate 
slowly, referred to as rVabT+A+ alloslow. Each is 
thoroughly washed and then recycled through the above 
isolation procedure a second time. Such enrichment cycles 
are continued until a clear change in entire populations 
t* for dissociation is seen at which time the population 
is termed rVabT+A+ (fast or slow). Their numbers are then 
determined, if need be after amplification. If these 
populations are small, individual rVabT+A+ (fast or slow) 
can be isolated at this time and assayed directly in 
subsequent procedures. If large populations are obtained, 
they can be analyzed in subsequent steps to isolate 
subpopulations which have other desirable target 
attributes, e.g. specificity (S+) among one of a large 
number of target family members. 

b. Identification of rVabT+A+ from 

rVab Libra irv Using Competition Asgavs 

The second approach to isolating rVab capable of 
target modification is used for the isolation of rVabT+, 

30 whether or not the S properties have yet been determined, 
which are target regulators which bind to targets at the 
same domain or at a domain overlapping with that used by 
the target's natural signal (nS) endogenous ligand. These 
are considered as competitors with nS for binding to the 

35 nS binding domain, and therefore are competitive 
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modulators , not allosteric modulators. Both agonists and 
antagonist replacements for endogenous ligand will be 
found within this population. 

This process requires the use of a high affinity nS 
which is labelled (nS*) and capable of rapid and 
quantitative isolation. There are many such labels 
possible, one is biotin, another, for example, is the 
small antibody epitopes for which high affinity sera (or 
monoclonal antibodies) exists commercially. Methods of 
making such a labelled nS and the available 
epi tope/antibody combination for protein signals and 
organic molecules are known to those skilled in the art. 
Labelling is a relatively easy procedure for protein nS. 
For organic molecules it is much more difficult but in the 
preferred cases where labelling has not yet been done, 
non-neutralizing monoclonal antibodies or biotin will be 
used by methods known to those skilled in the art. 

The preferred process of identification and isolation 
of competitive rVabT* (S determined or undetermined) which 
is outlined here uses biotin as the nS label ("tag"). The 
process works similarly using other labelling tags such as 
iodination with ,25 I, or [ 32 p]ATP phosphorylation. 

The biotinylated high affinity signal *, nS"*, and the 
rVabT+ library to be tested (previously isolated and 
identified as T + ) are combined with a soluble active form 
of the target (Tr) and incubated so as to allow formation 
of significant numbers of nS u, :Tr as well as rVab:Tr 
complexes. The incubation conditions used here are those 
previously used to allow binding of the rVab library to 
m-Tr as long as these conditions also allow nS* 8 binding 
to Tr. The temperature is then lowered to 4°C and all 
nS** and nS^rTr complexes are removed from solution with 
strepavidin (or another tag recognizer coupled to some 
matrix) . The supernatant, containing T:rVabT+ complexes 
and free rVabT* is affinity separated to isolate only 
Tr:rVabT+ by either panning over anti-Tr antibody coated 
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dishes or passed through anti-Tr antibodies coupled to 
agarose. The anti-Tr antibodies used in this step do not 
alter rVabT+ binding to Tr. Such antibodies are known to 
often be those which have epitopes at either the amino or 
carboxy termini of the Tr under study or some other 
non -modulatory (i.e., non-active) target domain. The 
population of rVabT+ bound to Tr in solution and obtained 
by association with anti-Tr antibody on their own matrix 
can be isolated and recycled through the above procedure 
any number of times for enrichment and amplification. 
This population contains all rVabT+ library members which 
bind to Tr at the binding site used by the target's nS. 
This population is therefore made up of rVab which bind to 
the nS binding site and will be labeled rVabT+. Even 
though at this point these active rVabs are 
15 uncharacterized as to agonist or antagonist activity, 

their classification as active rVab is appropriate based 
on the definitions and disclosure of this invention. 

Individual entities within these populations may be 
isolated, tested for agonist or antagonist activity using 
20 standard in vitro , cellular or in vivo assays known to 
those skilled in the art, and/or labeled by procedures 
known to those skilled in the art and used for screening 
for agonist and or antagonist SOMERS. Furthermore , where 
a labelled nS"* exists for Tr, individual rVabT+A+compt 
25 will be tested for competitive modification of nS"* 

binding to T by methods known to those skilled in the art. 
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c. Isolation of rVabT+ Which Are A+ By 
Allostericallv M odifying Targets 

The next process outlines the isolation of rVabT+ 
which allosterically modify Tr (i.e., are A*) by binding 
to sites which do not alter nS binding but do alter the 
ability of the target to be active even for targets devoid 
of native signals. In these cases, active rVab will be 
isolated by virtue of their ability to alter the 
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association of T and some component of the signal 
transduction system used by the target . For G coupled 
receptors, that would be the GTP-G protein complex; for 
targets with catalytic or stoichiometric enzymatic 
activity that would be nonhydrolyzable substrate analogs; 
and for channels or transporters it would be ions, 
molecules transported, electrochemical gradients or other 
channel subunits. In these cases the isolation of this 
type of rVabT+A+ would occur either by a) testing in 
batch mode limited sized libraries i.e., rVabT+A+ for 
agonist or antagonist action in vitro* or b) isolating in 
batch mode those which altered Tr activation, i.e. 
phosphorylation, binding of ATP or GTP, or binding of 
other proteins involved in signal transduction as outlined 
above. Library members which are T+A+ may be diluted and 
retested until single entities are identified. 



10 



d. Identification of rVabT+A+ Pairs 

When Single rVabT+A+ Ar e Not Identical 

If no single allosteric or competitive rVab is found 

20 in cases where an nS exists by one of the above 

approaches, the following procedures are capable of 
identifying pairs of entities which, are both required 
simultaneous as the necessary condition for modification 
of the target. In these procedures, the pairs of entities 

25 tested will be provided by two differentially identifiable 

rVab libraries or preferably one rVab library and another 
large and highly diverse library of identifiable 
molecules. For targets with large protein signals, such 
as growth factors cytokines, etc (i.e., >10,OOOD) which 

3q may be expected to have more than one LIGATT this dual 
modifier assay will be the preferred approach in one of 
two general alternative forms. 

The basic procedure will be described first using two 
differentially labelled rVab library as sources of the two 

35 paired modulatory entities. In addition to the rVab 
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libraries there are both a labelled Tr (Tr*) and a 
labelled high affinity signal (haS) which are also 
recognizable independently and separably from each other 
as well as from the rVab by high affinity probes. In each 
case, recognition of target occurs whether or not these 
5 entities are part of any type of Tr complex but does not 
perturb the target's ability to bind haS* or rVab. For 
example, the labelling epitope contained within the Tr* 
could be one which is recognized by a high affinity Ig at 
sites commonly known to those skilled in the art as 

10 non-neutralizing epitopes. Large protein targets are 
known to encompass such sites within internal peptide 
sequences, N- or C- terminus or unmodified or modified 
amino acids. These epitopes need only be exposed during 
complex formation and non-active, i.e. u n able to modulate 

15 target binding of nS when occupied by recognition antibody 
which can be easily established in each case. 

For signal labels, either biotin or an integral Ig 
epitope, are the preferred label, allowing avidin- or 
Ig- agarose respectively, to be the quantitative recovery 

20 probe as long as the labels do not significantly reduce 
affinity for the target. Other possible labels include 
identifiable peptides or protein sequences", such as 
substance P, partial HSV viral coat protein sequences, and 
enkephalin. The antibodies for such small epitopes or 

25 peptides could be either polyclonal or monoclonal Ig, 
commercially available or rVab as procured by the 
recombinant methods referred to for targets disclosed 
herein. Biotinylation of various signals and testing for 
non-interference with native targfet signal binding to Tr 

30 is available by many methods known to those skilled in the 
art . 

Using an Ig epitope labelled or tagged Tr (Tr*) and a 
biotin- labelled high affinity signal (haS) , the 
identification and isolation of a pair of modulatory 
35 entities (in this example both are rVab) is initiated by 
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combining sufficient numbers of two previously isolated 
large rVabT* populations, each with a specific Ig epitope 
{epitope 1 for rVabl and epitope 2 for rVab2), with the 
haS* 0 * 1 and the epitope tagged Target (Tr*) to allow 
formation of the trimeric rVabl :T:rVab2 complex which does 
not bind haS 1 ***. 

rVabl and rVab2 may be added initially at a variety 
of about equal concentrations from 10 x l 1 down to 10* M. 
The lowest concentration at which target activation occurs 
will be used for subsequent manipulations. The upper 
number is arbitrary but should theoretically exceed by 
about 3 0 fold the concentration needed for rVabl or rVab2 
to bind to Tr so as to saturate the site and prevent 
binding of has*. The mixture is then allowed to incubate 
at room temp for at least approximately 6 hr, or overnight 
and then saturating amounts of avidin- agarose is added and 
the mixture centrifuged and the supernatant, devoid of any 
free has 1 ** or Tr:haSbiot complexes, is removed for 
subsequent use. The supernatant, containing dimers of 
Tr:V AB l and Tr:V AB 2 and the desired trimers of V A l:Tr:V^2 
are then panned over anti-T Ig attached to a solid matrix 
or support such as for example, plastic culture dishes or 
agarose column matrixes. 

Identification and isolation of Tr complexes having 
both rVabl and rVab2 concurrently bound can be made by 
panning successively over matrixes -coated with anti-rVabl 
and then anti-rVab2 Igs. Phage displayed rVabs isolated 
by this procedure can be separated, amplified and then 
used for secondary cycling through the above isolation 
procedure. Finally, individually purified phage are 
tested in identified combinations for competition of 
haStag binding. 

In the above case, the two rVabT libraries (i.e., 
rVabTl, 2 + ) can be easily distinguished for example by 
utilizing the CHI domain of humans on one and the CHI 
domain of mice on the other. Ig specific for human and 
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mouse CHI are available commercially. Use of other 
constant regions from one specie is also possible. 

e. Use Of rVab- Peptide Libraries And 
Other Probes To Identify Multiple 
LIGATT Targets 

i. identification Of First Ligand For 
A Multiplp y.THATT Target 

There are a number of variants to the above 
procedures in which the second entity of the pair needed 
to compete for haS* - * binding would not be another rVab but 
instead would be a member of another library containing 
diverse small organic molecules, peptides, nucleic acids, 
carbohydrates or even natural products. Excluding the 
possibility of stearic hinderance, the frequency in the 
rVab library of entities which bind to a target in a 
modifying manner (given their paired entity is also 
present) should be no different than that for rVab which 
are able on their own to bind to Tr surfaces and modify 
signal binding. Accordingly, rVab libraries of the size 
generated by this invention may be used to identify both 
rVab members of the sought after pair. All of the 
libraries stated above having in excess of 10 u members /ml 
should be suitable for use with this invention provided 
the frequency for each binding event is not less than 
10" 5 . A useful library or pair of libraries should contain 
sufficient members so that two binding events will occur 
simultaneously on the same Tr, the condition necessary for 
inhibition of has* binding, at less than about 10 u and 
therefore be present at least once per reaction. If the 
frequency of each event is greater, i.e., 10 -1 or 10' 3 then 
these modulatory complexes will occur as frequently as 10 
to 100 times per assay. As the purification of an active 
phage displayed rVab per cycle is 10* 2 to 10* 3 then up to 4 
cycles may be needed to purify the active entity. To 
obtain one member of the pair, one only has to purify from 
35 the final step, one of the two rVab entities. When other 
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libraries are in use as the source of the second pair 
member, they need not be isolated at all. 

ii. Identification Of Second Or Subsequent 
Ligands For Secondary LIGATTS Of A 
Fftgtiple frlSATT Target 

Once one member (primary member) of the pair is 
identified, which in the above case would be a rVab the 
isolation of the second is made straightforward by using 
the first member, at saturating concentration in all 
reactions. This simplifies to a search for a single 
entity, which for a rVab, would be done as outlined above. 
However, when one rVab of a pair is in hand, one can 
search through a chemical as well as a rVab library for 
the second member of the pair of Tr binders which regulate 
Tr activity when simultaneously bound to the target. Each 
member of the pair, particularly those which are 
identified as members of a chemical library, are potential 
candidates as one half of a pair of small organic 
molecules, one for each active surface domain required for 
target regulation, which when covalently linked together 
would provide a single active organic molecule referred to 
as a DISOMER. Such DISOMERs would be valid interesting 
drug discovery leads. 

Another protocol for identifying an active pair, 
i.e., a pair which is necessary and sufficient to bind to 
Tr in such a manner as to displace haS* 1 , is to perform 
the original incubation of tagged target (Tr*) , high 
affinity target signal (has) and target binding rVab 
(rVabTrl or Tr2 + ) in the presence of excess labelled Tr* 
to reduce to a minimum the presence of unbound rVabTrl or 
Tr2 + . If these incubations are done in the presence of 
has at about a 100 fold excess of the Tr- saturating dose, 
the only rVab in solution will be those which has been 
competed from binding by has. Accordingly, those rVab 
prevented from binding to Tr by has, should, with high 
probability, be those which can prevent has binding to Tr 
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and are expected to possess the desired activity- As 
bound rVab can be separated from free rVab via panning 
over anti-Tr Ig (or avidin with a biotinylated Tr) , upon 
such removal of all rVab:Tr* complexes, the only rVab 
remaining in solution will be those pairs which when bound 
together, and possibly individually, prevent haS binding. 
Recycling of the supernatant additional times through such 
a paradigm will eventually result in identifying the rVab 
pair or at least one of its members if another type of 
ligand is used as the source of the other half of the 
active pair. 
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iii. Use Of rVab-Peptide As Surface 
Scanners _ 

For signals such as protein hormones and growth 
factors, where dimerization or timerization of identical 
(i.e., homoligomeric) or different (i.e., heteroligomeric) 
receptor units is required for receptor activation. This 
invention solves the problem in one embodiment by creating 
bivalent rVabs which allow for the isolation of bivalent 
active rVab surface reporters capable of identifying each 
receptor subunit endogenous ligand TARGATT attachment 
site. In this process, identification of bifiunctional 
active surface reporters, proceeds by taking a plurality 
of rVabs which have previously been identified as 
recognizing either a particular limited surface of one of 
the target's subunits (i.e. are T + ) , or a larger number of 
one or two selected groups of amino acids which are known 
to be involved with endogenous ligand binding. The genes 
encoding these rVabT + ligands are modified to encode for a 
flexible amino acid which attaches in frame to one end of 
either the heavy or light chain construct, a library of 
small random peptides to create a bi functional scanner 
(rVabPEP) . In one embodiment, the peptide is encoded by 
DNA used to that encoding the heavy or light constant 
domains. In another embodiment an rVab is expressed with 
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at least two peptides for identification of trimeric 
receptors . 

In a preferred embodiment , a bifunctional scanner 
library consisting of rVLCL and one rVHCHl is constructed 
to identify rVab-PEPs which recognize an active surface 
consisting of two TARGATTS on the surface of the target. 
rVab-PEP are then isolated in batch mode and individual 
member are subsequently identified as active competitors 
for endogenous ligand binding. Such rVab-PEPs do not 
significantly bind the target in the presence of excess 
endogenous ligand. These bivalent rVab-PEPs will then 
prebound to target will prevent binding of the target 
endogenous ligand which has been immobilized on a solid 
matrix. 

For homodimeric receptors where each target subunit 
has a TARGATT which binds to the ligand (as per Growth 
Hormone Receptor, GHR) , rVab-PEP would be isolated. The 
rVab portion of a first active rVab-PEP is then labelled 
for use as a reporter to identify S0MER replacements for 
the LIGATT which resides within the rVab portion of the 
active rVab-PEP entity and recognizes one TARGATT on the 
surface of the receptor. To identify a second S0MER 
replacement for the second LIGATT of the rVab-PEP entity, 
which resides in the PEP portion of the rVab-PEP entity, a 
second rVab without peptide is identified from the library 
25 of active rVab-PEP which competes for binding with the 
peptide portion of the first rVab-JPEP. The process of 
finding the two rVab which correspond to the two LIGATT 
residing within an active rVab-PEP entity is referred to 
as rVab Pairing. The second rVab is then labelled for 
30 conversion to a reporter for identification of SOMERS for 
the second LIGATT site. 

Where the targets are heterodimers, the preferred 
approach is as follows. The rVabT* for receptor subunit 
surface I, are grouped based upon recognition of common 
35 domains and/or surfaces containing amino acid known to 
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affect binding of endogenous ligand. These rVab's are 
then expressed as rVab-PEP as described above to generate 
a series of bivalent ligands. Members of this rVab-PEP 
library which are displaced from target by endogenous 
ligand and which also displace endogenous ligand from the 
target are selected as above for homodimer receptors. A 
limited number (< about 10) of rVab-PEPs with endogenous 
ligand displacing activity at the target are then selected 
for identifying a ligand for the second (II) binding site. 
An alternative selection method for identifying site I 
ligands is to select rVab-PEPs based on their ability to 
activate target. Activation may be detected as described 
above based on modification of an allosteric effector or 
on some other detectable change associated with receptor 
activation. For example, activation may be associated 
15 with self phosphorylation or dimerization. rVabs for the 
second TARGATT site on the second receptor subunit of the 
heterodimeric are identified in one embodiment, by 
expressing rVabs as a rVab-PEP library using rVabs 
previously identified as being competitive for the 
20 endogenous ligand at site II. The resulting rVab-PEP 
library for site II is then tested for activity as 
described above and active members are isolated. 
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V. Identification Of rVab Which Are Selective (S*) 

25 m order to isolate those rVab which are selective 

for and distinguish among closely related members of a 
target family or any target of concern (i.e. selective), 
the following batch mode selection procedure may be used. 
The rVabT* under investigation is mixed with matrix 

30 immobilized target (m-T) and allowed to form complexes in 
the presences of soluble peptides, recombinantly obtained 
protein fragments or intact targets whose identical (or 
related) sequences or "conformations are found in targets 
for which the investigator does not wish the rVab to bind. 

35 These sequences are typically between about 6 to 12 amino 
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acids in length and are present in the targets for other 
endogenous ligands of the same gene family. After 
sufficient time for complex formation the rVabT + still 
bound to matrix are isolated by panning and preferably 
recycled 2-3 times for enrichment as noted above to derive 
rVabT*S*. This procedure can be done before or after any 
of the above procedures related to isolating Active (A + ) or 
Target recognition positive (T + ) library members. 

If all screens for T, S, and A are accomplished, the 
final library would be rVabT+A + S + given that there was 
only one LIGATT and one TARGATT required for regulation of 
the target and thereby represent individual entities which 
describe target sites suitable for screening for SOMERS 
with all three attributes of a CAP. Where there are more 
than one LIGATT and one TARGATT required for target 
regulation, i.e., when the target is multimeric or even 
monomeric but contains multiple TARGATT domains, the full 
CAP, including activity (A+) , can only be observed with a 
bivalent rVab, such as would be found in an active rVab- 
PEP. In such cases, the rVab portion of the active 
bivalent rVab would not be active on its own. 
Nevertheless, since it still can identify SOMERS we refer 
to it as A* . 

Clearly, high affinity (less than or equal to about 
30 nM) and selective target recognition do not require the 
antigen pocket of the Vab be made up of two V domains as 
found in native Ig molecules but can exist in single VH 
domains containing only 3 CDRs . Based on the information 
in the art, improvements in making useful single chain 
(rVvx; i.e., vh or vl) with T + , S* and A* properties are 
expected by utilizing constant domains other than CHI, 
i.e., using gamma 2 or 3 or delta. This invention also 
recognizes the need for solubility of the recombinant 
proteins used to construct the members of the rVab, rVvx 
and rVab-PEP libraries. To be acceptable, changes in 
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solubility would not adversely effect VH; VL structure in 
an rVab. 

When using single chain libraries, select the rVvx 
entities which modify pharmacological target activity via 
binding to its surface. Refer to these as the active 
5 rVvxT+A + libraries (LIB) . Isolate actives based on: 

i. those whose binding is modified by the 
presence of the endogenous ligand; 

ii. those whose binding is modified by any 
allosteric regulator of the target 

10 iii. those whose binding alters target (i.e. 

target phosphorylation or association with G 
proteins) . 

In the case of i and ii, actives are isolated as 
soluble entities and in iii precipitated by anti 
15 P0 4 -protein or G-protein antibodies. In i endogenous 
ligand is used ad 300x Kd. In all cases harvest 
positives, amplify, and reisolate. 

Group as to common surface domain recognized by 
rescreening active rVvxT*A*. LIB against target in 
20 presence of small peptides (10-12 amino acids) or large 
peptides made recombinantly (20-50 amino acids) which 
define the target domain. In this assay, those soluble in 
presence of peptide are grouped together, and all data are 
used to construct an antibody surface map. 
25 The members of the rVab library which are 

particularly useful in automated binding assays and 
screens for SOMERS at preidentif ied target sites possess 
preferably the following characteristics. 

a. <30nM affinity for target ; 
30 b. recognized target sites are smaller than 

those used by endogenous ligand signals; 
c. possesses agonist or antagonist activity 
when bound to an active landscape whether 
it be those used by endogenous ligand or 
35 allosteric sites; 




WO 96/04557 



PCT/US95/10182 



- 78 - 



d. specificity for binding to only one among 
many related members of a target family; 

e. little nonspecific binding to unrelated 
targets and substances related to the assay 
itself; 

f . easy and homogeneous and single tagging 
with a label ; 

g. labelling which allows both rapid and 
sensitive quantitation of target binding 
and; 

h. a framework of known structure which 
delineates the location in space of the 
contact points of the reporter with its 
target . 

The latter attribute is critical to the solution of 
the 3D structure of active SOMERS as it allows the problem 
of deducing the 3D- shape of the LIGATT on the target 
surface scanners which are active and in contact with the 
target to be solved after obtaining the one dimensional 
linear amino acid sequence of the reporter with the use of 
genetic algorithms. The 3D landscape of the LIGATT on the 
active rVab is directly transf ormable into a 3D landscape 
of the sought after SOMERS. 

VI. Identification of biologically enhanced ensemble 
pharmacophores ( BEEP) 



A. Combine structural information from identified 

members of library possessing desired attributes 
of potency, activity, selectivity, and 
specificity 

In trying to identify useful rVabs and to deduce the 
structure of the BEEP, the ability to genetically simplify 
(e.g., reduction in number or size) or further diversify 
(e.g., increased number of randomized amino acid 
positions, or increased size) of CDRs and CSRs within 
active rVab libraries or within one rVab is of critical 
significance. This is because not all contact amino acids 
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contribute the same energy to antibody binding and 
sometimes one amino acid cam account for >99% of binding 
energies. Just the 3 CDRs of one VH can provide 10-100 nM 
of Ig target affinity- rVab phage libraries of about 10 12 
members with secondary diversifications in any number of 
5 regions can be derived from a small number of active rVabs 
found initially by processes of the invention previously 
described, by PCR as used to construct the rVab library 
(see below) and or oligonucleotide insertion, known to 
those skilled in the art to provide an acceptably large 

10 enough source of target surface scanners and reporters as 
envisioned by this embodiment of the invention. In 
addition, it is clear that active surface scanner rVab 
will be needed which recognize different local surfaces on 
the target in order to generate sufficiently large amounts 

15 of one dimensional amino acid sequence information so as 

to accurately deduce a BEEP which is not only accurate for 
predicting the structure of one S0MER but is capable of 
predicting the ensemble of active SOMERS which can attach 
to that site. 

20 a particularly novel aspect of this invention is that 

it establishes a way for the CDR regions of a VH or a VL 
alone or complexed together as rVab to be reduced to a 
minimum structure which occupies the target sites 
recognized by the rVab and have a desirable CAP- An 

25 advantage of identifying such a minimum structure is the 

potential reduction of target affinity to a level which is 
compe table in standard binding assays by endogenous ligand 
and potential SOMERS and of the number of critical atoms 
participating in target contact. The smaller the number 

30 of contact points the simpler the resolution of the BEEP. 

B. Create Beeps For Each Active r Vab Subset 



35 



According to this invention, BEEPS are created which 
contain the coordinates and attributes of the active 
elements of the 3D surface of active SOMERS for a 
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particular surface domain on particular pharmacological 
targets. The starting point for this ip grouping together 
of rVabT+S + A + members of the rVab library according to 
common target surface domain recognized which in the first 
instance will be that which is overlapping, or identical 
to endogenous ligand. 

In a preferred embodiment: 

a. Each surface group is partitioned and one 
rVabT + S + A+ for that group is isolated . The VHCH gene is 
then cloned out and used to derive a new combinatorial 
library. To derive this new combinatorial library the 
cloned rVHCHn is paired with all rVLCL for rVab members 
which bind to the common surface. 

b. Isolate via panning (as done for the original 
LIB) all new combinational rVab members (i.e., rVHCJT: 
rVLCL"- rVab) which are T + S + A + for the original common 
target surface domain. This library is called rVab^ 
Repeat for each VHCH in the original rVab thereby deriving 
a rVabvH»+ 1^+2*+... set which identifies all related VH and VL 
for a particular surface domain. These libraries will 
provide multiple combinations of defined VH genes with all 
VL's for a given surface. Alternatively, these various 
libraries may be made by identifying specific VL genes and 
cloning them into libraries containing all VH genes 
identified for a given surface target. 

c. Determine via PCR the amino acid sequence of all 
VL in the set which can bind to all VHs in the library. 

d. repeat a-c for all active V„using [V LJWI . ta . fc+ _. 

e. The spacial coordinates for the framework of the 
parent antibody in which all randomized CDRs were placed, 
along with the coordinates of the various CSR and CDRH3 
for the active VH and VL for those entities found in the 
particular local target surface domain rVab library under 
study along with the amino acids identified in these CSRs 
and CDRs are solved in a genetic algorithm to determine 
the 3D conformation of the pharmacological target 



PCT/US95/10182 

WO 96/04557 



- 81 - 

landscape occupied by all active rVab members which 
recognize the same surface domain. This solution is a 
biological enhanced ensembled pharmacophore (i.e., a BEEP) 

f. Repeat for rVab library for other local active 
target surface domains. 

5 g. If any data base is not sufficient, take the 

relative set of VH genes and excise their CDRH3 domain and 
replace with a random oligonucleotide encoding a peptide 
library of preferably 8 to 10 amino acids. The potential 
size of this library is between about 8 W -10 20 members. 

10 Repeat selections to obtain new diversity enhanced LIB. 

C. Use of genetic algorith ms to create BEEPS 
Creation of the BEEP begins after isolation of a set 
of active rVabs {Vi}i=N, which contain members (Vi) which 
have been verified as having the desired attributes of 
affinity, selectivity and activity at the target, where 
N - the number of such members within the set. In the 
preferred instance, each active rVab will have all three 
of the above attributes, but it is also possible that only 
two, or only one, of the attributes will be desired and 
therefore will be present. For this description, TSA+ 
will refer to the active rVab irrespective of which 
attributes are present. Each TSA+ rVab member is then 
isolated and its amino acid sequence determined using 
procedures known and available to those skilled in the 
art. For example, commercially supplied kits and an 
automated sequencer (ABI, USA) . 

According to this model, it is assumed that an active 
target surface binds different rVabs, through the same 
site of the target surface, and accordingly, at least a 
subset of those rVab are expected to possess similar 
surfaces. Thus, finding a recurring, i.e., common, 
surface motif (which we refer to as the BEEP) in different 
rVabs indicates either: a) the common rVab surface plays 
a role in target : rVab interactions; and b) that this 
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interaction could be duplicated by other molecules with 
similar surfaces. Therein, it is a common surface which 
is responsible for the common phenotype of at least a 
subset of the L, members of the original set of TSA+ 
rVabs. There may be one or more common surfaces within 
the original set of TSA+ rVabs. This duplication takes 
the form of the BEEP first, and subsequently small organic 
molecules. 

Given such a collection of TSA+ rVabs and their amino 
acid sequences, a preliminary set of surface scanners 
{Lj^N, where each Lj is a model of an antibody molecule, 
is constructed according to the invention using the 
canonical structural principals of Chothia (Chothia and 
Lesk 1987, Chothia 1989, and Chothia 1992) and the 
information on the crystalline form of the parental 
antibody used as framework for construction of the rVab 
library as described by this invention, N is the number of 
such TSA+ rVab surface scanners which define the 
fundamental geometry which is the position of surface 
atoms within acceptable distances from each within a 
generally known structure. Shape descriptors rely on 
known CSR and CDRH3 shapes, and the amino acid sequence 
within these domains. Subsequently, chemistry 
characteristics, such as charge, hydrophobic interactions, 
exposed/buried surface area, hydrogen bond formation etc., 
known to those skilled in the art will be considered. 

In the preferred case, each TSA+ rVab contains one VH 
and one VL chain, with 6 complementary determining regions 
(CDR) wherein three (CDRVL1,2, 3) are within VL and three 
(CDRH1,2,3) are within VH. Furthermore, in the preferred 
case, there are the 5, 1 and 6 different canonical 
structures consisting of a different known canonical loop 
structure possible for every CDRVL1,2 and 3 respectively, 
and 3, and 4 different' canonical structures consisting of 
known canonical loop structures possible for every CDRH1 
and 2 according to the invention. The CDR for H3 , 
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although not canonical, in the parental library will have 
one of three defined structures in its parental mode 
before the amino acids positions within each are 
randomized. Furthermore, the prior knowledge of rVab 
framework and relationship of the 6 GDR domains within the 
5 framework provides additional structural information for 
constructing an 1^ and eventually a BEEP. In addition, as 
the number of known antibody structures increases, new 
canonical structures became known and may be incorporated 
into the rVab libraries to allow isolation of TSA+ rVabs 

10 containing such structural loops. 

Each Lj can be represented, for the purposes here, by 
the atomic coordinates of the constituent atoms of the 
rVab which is a member of TSA+ set. The surface (S^ of 
the preliminary model L s can be parsed by its CSRs and 

15 CDRs wherein 

5 i - [ (CSRl) t , (CSR2) s , (CSR3) i , {CSR4) if {CSR5) ± . (CDR6) J 

wherein 1 through 5 denote CSRVL1, 2, and 3 and CSRH1, 2, 
and 6 denotes CDRH3, respectively, and wherein with each 

20 (CSR)j, for I*i there is a particular sequence. 

The surface (S^) can be repositioned and reoriented 
in space by transforming the atomic coordinates of the Li 
according to: S^-G^Lj, where L iF is a model of surface 
scanner i defined by the coordinates of its constituent 

25 atoms and 6^ is a matrix that transforms Lj. Furthermore, 
G% is paramaterized by the translation and rotational 
parameters (%. X\. Yi, Zi>3- Thus, as scanner i is 

rotated and moved into a new position j , and the CDR are 
carried along with it. 

30 T he genetic algorithm of this invention, referred to 

here as D I OGAM, takes the initial set of {L^}, where the 
superscript (°) means % preliminary model' , as input data 
to produce from that data as output the theoretical common 
surface (i*e., the BEEP) which represents the best overlap 

35 
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in terms of chemistry and geometry for members of the set. 

In general, a genetic algorithm (Holland, J.H., 1992 
and Goldberg, D.E. 1989, which are herein incorporated by 
reference) operates on 'genes' to produce variation which 
5 through selection yields * survivors ' . The genes of 

survivors (as judged by % fitness') are then mutated to 
produce newer progeny for further fitness selection. 
Thus, mutated genes, according to the genetic algorithm of 
the invention D I OGAM, are produced and encode altered 
surfaces, which in turn are altered phenotypes. 

The definition of a "gene" for use in the model of 
this invention is a specific sets of values for the 
parameters of G^ {<p % , Xi , Xj, y, z ; )j. Varying these 
parameters changes the position of the surface Sij which 
we define here as the phenotype of the given gene. 

Herein, [ {Gj°}] j«l,M is a population of £4 variations 
of the model L.i, which encompass all possible ways to vary 
the surface of the model, on each member of the TSA+ rVab 
set which gives rise to subsequent models (1st progeny 
generation, 2nd progeny generation, nth progeny generation 
models [1-nJ ) wherein n = the number- of the generation. 

The initial creation of preliminary models follows in 
one embodiment the Computer Vision algorithm for 
structural and surface comparison of proteins (Fisher et 
al.; 1994) using a small number of points, rotational and 
translational in nature for unique definition. This 
method is based on the previous method of the Geometric 
Hashing Paradigm (Lamdan and Wolf son 1988 and Lamdon 
Schwartz and Wolf son, 1990). This' method finds 3D motifs 
within different segments or by isolated single amino 
acids, independently of any linear sequence of amino 
acids. The later provides for incorporation of all 
important amino acids or groups thereof located within the 
5 CSRs and 1 CHDH3 and which by themselves do not occur in 
35 a singularly linear sequence within any rVab. 
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Using only distance invariants, this program obtains 
data from surface superpositioning which is then used to 
solve for portions of the rVib which represent analogous 
portions of surfaces of ligands directly involved in 
ligand- target binding requirements, i.e., the 'docking 
5 problem. ' Various types of surface superpositioning can 
be used, and includes docking of rVabs, one rVab and one 
target, and one rVab and one target related ligand. 
DIOGAM uses an efficient automated computer vision based 
technique for detection of three dimensional structural 

10 motifs (Fisher, D. , et al., 1992; and Bachar, 0., et al. 
1993) . In this process, seed matches are found first, 
based on the Geometric Hashing Paradigm, the clusters of 
seed matches are found using rotational and translation 
parameters to fix 3D motion. Here the seed matches will 

15 be done within specific sized balls, using different pairs 
of balls, the subsequent clustering added by known CSR 
structure and CSR and CDR relationships within each rVab. 
Extensions will be extensive, eventually including all 
amino acids within each CSR said CDR, using reiterate ever 

20 growing cycles. 

Such clustering and extension (referred to here as 
additional level mutations (see below) ) can be used for 
both chemistry and energy analyses. Modeling will 
initially be done individually, then in an aggregate 

25 manner . 

Therein for each progeny generation, the sum of {S/} , 
wherein j=jth member of the ith scanner as appearing in 
the nth generation gives us a Target Fitness Landscape 
(Tj) : which is a set of numbers representing chemical and 
30 geometric properties of the maximally overlapped set of 
S b . For the purposes of this invention, Tn is a vector 
whose components, t j , include but are not limited to 
scaled electrostatic energy, buried surface area, hydrogen 
bonding, and local, curvature. 
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As the algorithms proceeds, it calculates at each 
stage, the target fitness landscape (T) and ascertains a 
mutational strategy for the next stage. Thus, depending 
upon the strategy, all N genes are mutated, producing new 
phenotypes for which a new value of T is calculated. The 
process is complete when T can be maximized no further. 

Thus D I OGAM alters the set of {<p % . Xi , «i, Xj, y, z t ) in 
order to achieve the best overlaps in the general sense 
(geometry, energy and chemistry) and the result is new 
Target Fitness Landscapes (i.e., T) defined to be a 
minimum when maximum generalized overlap has been 
achieved. 

The next or intervening phases of DIOGAM allow 
variation (i.e., mutation) in the Li themselves thus the 
genetic algorithm include s genetic varation of CSRs and 
CDRs. For DIOGAM, the mutated gene (i.e., the augmented 
or varied gene) is the collection of rotamer angles of the 
side chains themselves within the CSRs and CDRs. Such 
changes would include, as example, changing the rotation 
around a Cor-QE? bond (C=carbon) , which for a valine put it 
in result in 3 different positions) . For an arginine, 
there are up to 27 rotomers of the guanidium group. In 
the preferred mode, structural variations will be carried 
out early on. Considering mutational events, another 
level of variation could be rocking of the_ models. 
Further mutation (i.e., variation) .would be changes in the 
angle between VH and VL from 0-15 degrees, which has the 
effect of shifting the target residues within the genes 
over a longer distance which can be considered shifting C 
of positions. These mutations will include 'catastrophic 
events' having global implications for the position of the 
amino acid within the CSR or CDRH3 . These mutations 
enable local minima trapping to be avoided. Although the 
above mutational events are the first two preferred, the 
order of changes will be modified during the overall 
DIOGAM program. 
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° Note that VH CDRH3 is a special case. This is so 

because first there are no canonical structures for CDRH3, 
second, it is by far the largest CDR region with insertion 
sizes of up to close to 24 amino acids; and third, because 
it can influence the angle between VH and VL. Therefore, 
5 this region is the one of most variations with the least 
structural restrictions . 

According to the preferred mode of the invention, 
there are two positions within each CSR gene, which do not 
alter its canonical structure, and which are randomized in 

10 the rVab lib. as to amino acid. This translates to the 

possibility of any one of 20 amino acids being present at 
these two positions within each CSR and CDHR3 within any 
one of the Li members selected TSA+ rVab set under 
analysis. Therein, in the first level variation phase of 

15 DIOGAM, there is an arbitrary 'mutation', herein meaning 

rotation, of the gene allowing presentation of the various 
possible retainers for these two particular amino acids 
found within one TSA+ rVab at each of the two randomized 
positions within the gene. Such mutation events will also 

20 be used later with VH CDRH3 at its two randomized amino 
acid position - 

These mutants will then be analyzed by DIOGAM to 
derive other sets of T^ in the manner described above. 

Additional mutational events may also be utilized to 

25 produce further diversity to more fully describe the 
minimum structural requirements to define the common 
overlap (i.e., BEEP) which has the best TSA+ phenotype for 
the active site of the Target. Mutational events which 
effect fitness, will involve, but not be restricted to 

30 hydrophobic, electrostatic and conformational entropy 

effects, surface roughness, surface curvature, avoidance 
of unpaired charges, favorable and unfavorable steric 
interaction of functional groups and will be characterized 
by available programs like COGEN (Bruccoleri, R.E., and 

35 Karplus, M. , 1987; Novotny, J., Bruccoleri, and R.E. 
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Saul, F.A. , 1989; and Tulip, W.R., et al . 1994) and the 
multiple copy simultaneous search method of CHARMM 
(MiranJcer, A., and Karplus, M. , 1991; Patai, S. 1989 and 
Brooks, B.R., et al., 1993) using functionality 
descriptors with fewer atoms (Andrews, P.R., Craik, D.J., 
and Martin, J.L., 1984) or a spherical approximation to a 
multi-atom group (Goodford, P.J. , 1985 and Goodsell, D.S., 
and Olson, A. J. 1990) based on time dependent Tartree 
approximation or minimization (Elber, R. , and Karplus, M. 
1990) . 

Once these mutational levels (l°-n° level mutations) 
have been gone through one time, for each hf, there will 
be new children (perhaps hundreds to thousands) of the 
original parental rVabs. Structural parameters of the 
second are then put through the 'Nussinov- Computer Vision' 
algorithm (Fisher, et. al. 1994), which is included herein 
by reference, to obtain the best alignment. Details of 
this method and some applications of the program {Fisher, 
D., et al., 1992 and Bachar, O. et al . 1993) are included 
herein by reference. The lowest values of the target 
functions for each Tn, will be different. The values will 
include, but not be restricted to, rms (for geometric 
overlap) , AG (Gibbs free energy) and chemistry. The 
mutational events will produce progeny which will be 
selected as having <rms, <energy and <negative chemistry 
values than those of the parental targets. Together the 
sum of these values define an overall Target Fitness 
Landscape for each Tn. 

At this stage, DIOGAM will use commercially available 
algorithms, as described (see Goldberg 1990) by providers, 
and known to those skilled in the art, to score and 
register the results of each fitness test. At this stage 
then, there will be a list of <*,x».«^Xi,Yi,Zi for each 
and a running fitness score . (Ti j*) . DIOGAM then goes back 
to next cycle of genetic variations, doing these 
iterations for thousands and thousands of generations. 
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simultaneous, or in an ordered fashion, which at its 
termination will provide a list of best minima, which will 
be the 1st level BEEP, i.e., the best overlap of the 
surfaces contained within the set of active TSA+rVab. 

We have done this manually in the case of two 
antibodies (NC10 and NC41) to the same site (epitope) on 
the surface of neuraminidase (Tulip, W.L., et al., 1994) 
and Malby, R.L., et al., 1994) which have been defined 
crystallographically and which provides us with a 
population, here only containing two members, which 
approximates the TSA+ rVab population isolated by this 
invention. Analysis of this population has shown overlap 
of antibody CSR and CDR surfaces which are bound to the 
same epitope. Therefore, a Sij surface as envisioned by 
this invention can be made. 

At this stage, DIOGAM now goes back to the mutation 
stage and iterates, i.e., arbitrary changes rotamer 
position, overlapping the set, yet in so doing producing a 
slightly different set of w Xi* Yn z i* but more 

importantly, finding Ts which are different (higher or 
lower) from its predecessors. Thus every character of 
every gene will be updated to reflect the fact that it 
incrementally (differently) contributed to a more robust 
phenotype (target fitness landscape) . 

DIOGAM directs the algorithm to enter into its next 
stage, initiated after many such mutational iterations, 
its crossover or recombination stage, wherein it creates 
new combinations of genes, even without knowing what is 
good (better fitness) about an existing gene mutations. 
These combinations, i.e., mating, of genotypes (or 
isogenotypes) are based on T scores, equal phenotype 
selection of better fitness, wherein fitness is defined as 
contributing to maximal overall overlap. 

It is noted here that overlap is not restricted to 
physical occupation of identical space, but includes 
overlap defined, for example, as charge neutralization 
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wherein, for example, two negative charged residues may be 
scored as % overlapping' if they each could be within some 
o distance of a positive charge. 

In this entire process, it is important that the test 
tube selection of TSA+ rVab from the large rVab libraries, 
5 selects the right combination of genes which presently in 
no way can be guessed in advance. By definition, the 
combination existing in the active TSA+ rVab is % correct' 
as it contains the surface necessary for desired activity 
profile, i.e., consisting of one or more of the desired 
10 attributes of affinity, selectivity and or activity on the 
target . 

To summarize, in our genetic algorithm, D I OGAM, the 
gene is the object, the mutation is the change and the 
early selection is the testing by iteration to get a 

15 better number of individual genes. This is then followed 
by crossover using genetic logic of pieces of genes which 
are responsible for the fitness. This crossing over and 
recombination in the preferred instance includes deletions 
and additions of single amino acids or groups (referred to 

20 a seed clustering, or extension or simplification) . With 
regard to additions, this includes those amino acids 
within the CSRs, CDR and framework domains of the rVab 
which have not been randomized, and includes those within 
the CSRs which are critical to the canonical loop 

25 structure itself. The importance of deletions and 

additions to genes as later mutational events is important 
as published data (Malby et al . 1994) shows that for two 
antibodies binding to the same antigen epitope, one of the 
CSR in the pair does not make contact with the target 
30 surface and that large target recognition domains may 
themselves contain much smaller domains which are 
responsible for the most of the energy of target 
interaction (Clackson and Wells, 1995). For the purpose 
of this invention, the Ti of the best common overlap, 
35 i.e., the BEEP, is related to the existence of a small 
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subset of high energy density points in the atoms target 
surface (Clackson, T. and Wells, J. A. 1995; and Tulip, 
W.R., et al., 1994), which is considerable less than all 
contact residues. This is expected to simplify the 
alignment (i.e., overlapping) of the Lj for example if the 
target domain which is responsible for the TSA+ phenotype 
of the set selected rVabs is assumed to have just two hot 
spots then there is a very restricted number of ways a 
given antibody, known to interact with the site so as to 
have a TSA+ phenotype, can bind to that site. 



10 



D. Identify small organic molecules active at' 
target sites _ 

1. Use of BEEP as high volume screening 
reagent — 

15 The BEEP provided by this invention may be used as 

follows to identify SOMERS or drug leads. 

a. Use BEEP to electronically screen CHEMFILE to 
identify SOMERS as discovery leads using computer 
structural programs commercially available and known to 

20 those skilled the art. 

b. Use the coordinates of the BEEP to screen via 
existing computer technology entire chemical data bases 
for matching SOMERS. 

c. Select a few SOMERS and test in vitro and in 
25 vivo to confirm discovery lead. 

d. Use BEEP to direct synthesis of active SOMERS 
via techniques known to those skilled in the art of 
medicinal synthetic chemistry. 

2. Identification of SOMERS using rVAB- 
30 Reporters 

a. Select 1-2 representatives of each surface 
domain group within the active- selective rVabTSA* library 
and enzymatically label with, for example a radionuclide. 
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b. Establish competition binding assays using 
endogenous ligand and known alios teric target regulators 
as displacer labelled rVabTSA reporter. 

c. Screen chemical libraries via standard automated 
binding assays for SOMERs which displace labelled rVab 
from its target. Identify all close analogs of active 
SOMERS and perform SAR for target binding. 



identified as follows (See Figs. 21 and 22): 

a. Start with all rVab which recognize a surface on 
pharmacological targets. These can be selected following 
steps described above. 

b. Modify the phage rVab, rVvx library to contain 
one or two large random peptide libraries sufficient to 
occupy the other one or two TARGATTS which together make 
up the active surface of the target. After identifying a 
scanner rVab to identify one TARGATT identification of the 
others is accomplished which may also be done in the 
presence of the first discovered SOMER. Do limited SAR on 
each SOMER to identify the inactive elements, covalently 
oligomer! ze the two or three SOMERS via linkage through 
their inactive surfaces to make a DISOMER or TRISOMER. 
Test in vivo and in vitro to identify best Discovery Lead. 

c. Test most potent SOMERs for activity using an in 
vitro target assay. 

d. Test in vitro active SOMERS with best CAP in 
vivo (via I. P. route to identify Discovery Leads. 

If no analogs exists of originally discovered SOMERS, 
carry out limited synthetic effort, use A*rVabs or rVvx to 
do a limited SAR binding study and then select best and 
test in vitro and in vivo for entire CAP. 

If label reporter A*rVab or rVvx for a particular 
target domain does not uncover SOMERS or none are 
displayed by endogenous ligand, perform secondary 
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In a preferred embodiment, DISOMERS are 
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simplication or diversification of CSRs and CDRs, reselect 
for the TSA* and carry out 3A again. 

Screening for small organic molecular replacements 
(SOMERS) will be done by methods known to those skilled in 
the art using robotic assay employing labelled n[*]rVab 
with specific CAP and searching for compounds which 
displace [*]rVabT* binding to targets. 

e. Excise all rVHCH domains from rVHCHT* . LIB, move 
into the plasmid for bacterial periplasmic expression and 
create a library of soluble VHCHT + . Mix this library of 
soluble rVHCHT* entities and a phage library of rVLCL 
displayed attached to the phage coat protein through its 
CL region (rVLCL. LIB) to make a combinatorial library 
wherein only one member is packaged in the isolated phage 
and pan against target protein as in 2Aa. After 
15 enrichment (2-4 cycles of selection for one or more of the 
three desired properties) the genes for the active rVLCL 
entities are obtained. The genes for the active rVHCHT* 
entities may then be obtained in a manner similar to that 
used to obtain the rVLVL genes. After exision of both the 
rVLCH and rVHCH genes, the Cre-Lox recombination system 
(see below) may be used to construct a single phage 
containing both chains and for expression of the rVab.LIB 
as a phage displayed functional complex. In another 
embodiment, the libraries may expressed as single chain 
25 versions with VH and VL coupled through a linker using 

commercially available kits, such as those from Cambridge 
according to the manufacturer. Finally, enrichment and 
selection of VH:VL combinations which possess the desired 
target attributes may be obtained by, for example, 
panning . 
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EXAMPLE 1 

Construction of a Recombinant Surface Scanner 
rVab Library (rVab.libl. 
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VII. Selection of Parental Fabs of known crystalline 
Structure as rVab libra ry framework templates 

The amino acid sequences and crystalline structure of 
the light and heavy chains of the antibody ABXXX which is 
used as the parental Fab for construction of the 
rVab. library are obtained from the Brookhaven Data Base, 
the Kabat Data Base, GBNEBANK (email: NCBI.NIH.GOV.) or 
Kabat, E.A., et. al . (Kabat, T.T.Wu et al . 1991). The V 
regions of the light and heavy chains are subdivided in 
domains as follows: the highly variable complementary 
determining regions (CDR) , the canonical structure region 
(CSR) within each CDR, and the intervening framework 
regions (FWR) (Fig. 2 .5.6) . Individual amino acids not 
within a CSR or CDR, but nevertheless essential to the 
canonical structure (Chothia and Lesk 1987; Chothia, Lesk 
et al. 1989; Kabat, T.T.Wu et al. 1991; Chothia, Lesk et 
al. 1992) are also listed (Fig. 5, 6) 

ABxxx is selected as the parental framework template 
for the construction of the ABxxx rVab.lib for recognition 
of target surfaces by an antibody with a planer type 
antigen combining site. This selection is based on the 
following: 1) availability of the crystal structure of 
the antibody (bound or free of corresponding binding 
partner, i.e. antigen); 2) the antibody is a member of the 
planer type combining site group of Fabs (Webster, Henry 
et al. 1994) which have been found to recognize protein 
surfaces; 3) the antibody has canonical structures for CSR 
Hi -2 and LI- 3; 4) the antibody's CDRH3 size is in the 
mid -range of sizes of CRDH3 (so as to favor equal usage of 
all 6 CDRs of the rVab in target recognition (Wu, Johnson 
et al. 1993); and 5) the antibody's antigen is a protein 
(Fig. 3). Parental antibody frameworks found in antibodies 
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° with a cavity and a grove group type combining site 

(classification as reviewed by Webster [Webster, Henry et 
al. 1994]) will also be used to make two additional rVab 
libraries in a fashion similar to that described below for 
the rVab.lib based on ABXXX. Together these three 
5 libraries generate a sufficiently large number of probes 
for surface recognition of relevant binding sites. 

In the ABxxx rVab.lib the natural diversification of 
antibodies is provided by placing within the library 
varied combinations of VH and VI> domains which themselves 
*0 have varied combinations of the known canonical CSRs, 

variable length CDRH3s, and randomized amino acids (one of 
20 essential amino acids) at one or more amino acid 
positions within the CSR or CDRs of each V region within 
each rVab (Fig ,4) . 
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VIII. Creating the Nucleic Acids Encoding the Heavy 
and Light Chains (rVHCHl and rVLCL) for ABXXX 
rVfrfr t Ub, 

The nucleotide sequence of ABxxx is obtained from 
Sequences of Proteins o f -Immunological Interest. 5th ed, 
(Rabat, E.A., T.T.Wu et al . 1991); the Rabat Data Base 
(NCBI.NIH. GOV) ; or GENBANR . Identification and analysis 
of all restriction sites present within these sequences 
may be accomplished using a commercially available program 
(GCG [Univ. Wisconsin, USA] , MacVector [IBI, Rodak,New 
Haven, CT] , DNAStrider <C. Marck, Gif -Sur-Yvette Codex, 
France, Service de Biochemie, Inst". Res. Fundamental , 
Aloric Energy Commission of France) and SeqEd, [Applied 
Biosystem] ) . 

Restriction sites endogenous to ABxxx and conflicting 
with construction of the rVab.lib as outlined below are 
removed and replaced with other nucleotides not encoding 
the conflicting restriction site. This is done using 
sequences which keep unchanged the identity of the 
parental amino acid(s). 
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The sequences are then analyzed again for the changes 
necessary to place the convenient and unique restriction 
sites throughout the V and C genes needed for library 
construction as outlined below. 

The ABXXX rVab.lib is built according to this 
invention from separate rVLCL (Fig. 7) and rVHCHl (Pig. 8) 
chains which are combined randomly in an in vivo process 
(Fig. 14). The construction of the rVLCL and rVHCH nucleic 
acid libraries encoding the rVLCL and rVHCHl chains, is 
accomplished in steps outlined as follows: step 1) 
oligonucleotide synthesis: construction of a) amino 
terminus end <5'V) , b) a midregion (MIDV) for VL only, 
and c) a carboxy- terminus end (3'V) of the V region; step 
2) diversification via PCR of some CSRs; step 3) ligation 
of the sections; step 4) diversification of the remaining 
CSRs ; and step 5) ligation of the appropriate constant 
(CHI or CL) region derived by PCR or oligonucleotide 
construction to generate the complete recombinant heavy 
and light chain libraries (rVHCHl. lib and rVLCL.lib). 

5 te P Construction of rVLCL.lib (Fia. 7) 

In the oligonucleotide phase (step A, Fig. 7), 
construction of a) the 5' (5'VL) end; b) the Mid 
section (MIDVL) and c) 3' (3'VL) end of the VL region 
uses eight synthetic oligonucleotides comprising four 
complementary pairs. Each oligonucleotide (x) has a 
25 complementary mate labelled x' . Two oligonucleotide 

pairs, a/a' and b/b' are used to make the 5' end. The 
MIDVL (c/C), and the 3'VL <d/d' ) sections are each 
synthesized from one oligonucleotide pair. The amino acid 
and nucleic acid positions encoded by the specific 
oligonucleotides are shown in Fig. 7. 

The variance in amino acids at position 2 (within 
a/a') and 71 (appended to c/C) necessary to allow for 
construction of all the desired VLl CSRs is added during 
later steps as described below. All oligonucleotides are 
synthesized so as to have at least one overlapping 
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complementary sticky end, an absence of hairpin forming 
ends, and to be noncomplementary to sequences other than 
that of the desired oligonucleotide joining partner based 



primer analysis software program. 
5 fltgp Ka> i Construction of 5'VL Section 

For construction of the 5'VL end section in step 
1(a), the oligonucleotides are first phosphorylated, then 
mixed together in one reaction mixture, heated, annealed 
and ligated together using generally known molecular 

10 biology technology (Sambrook, Fritsch et al. 1990). The 
product is then isolated and ligated in 60 pi reactions 
with 1200U T4 DNA ligase (New England BioLabs) to 5 fig 
pCLONALL (see Fig. 9 which lists all general use plasmids) 
digested at restriction site (rs) prsO and rs4 ( B p tt 

15 signifies that the location of the restriction site is 
within the plasmid and outside of the rVab sequence) 
(Sambrook, Fritsch et al. 1990). 

DNA is purified from the ligation mixture using 
Gleneclean II (BiolOl) , resuspended in water and used for 

20 transfection by electroporation (Dower, Miller et al. 
1988) of E. coli TGI (Gibson 1984) grown in broth 
containing 1% glucose for Ih and then plated on dishes in 
antibiotic containing media. After overnight (o.n.) 
incubation at 37 *C, individual colonies are picked. 

25 Colonies are identified as rVL3-24 .bact first by 

diagnostic PCR using primers pCFWD and pCBCK (see Primer 
Table, Fig. 10) and subsequently confirmed by sequence 
analysis via automated an ABI sequencer and commercially 
available related kits as outlined by manufacturer 

30 (ABI, USA) . Storage of positive clones at -70*C is done in 

broth (Miller, 1972) containing 15% (v/v) glycerol. 
Step 2: Diversifi cation Bv PCR 

Toothpicked frozen glycerol stocks of rVL3-24 are 
used in PCR reactions to append primers conferring 

35 diversification to the rVL section. One of the five 



on analysis by a commercially available oligonucleotide 
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different CSRLl diversified with random amino acids at two 
positions is used as the FWD primer at the 3' end of the 
parental ab/a'b' 5 'VI* section. The BCK primer for the 5' 
end comprises nucleic acids encoding one of the three 
different amino acids I,V or S at position VL2, and the 
5 amino acid of the parental ABXXX at position VL1. These 
appendings are done in 5 primary PCR reactions, each 
containing one FWD primer (i.e., L1.1FWD, L1.2FWD, 
L1.3FWD, L1.4FWD or L1.5FWD) and one of three different 
BCK primers in the following combinations: L1.1-3BCK 
primer mixed with the 3 reactions containing Ll.l, L.12 
and L.13 FWD primers, and L1.4BCk and Ll.SBCk mixed 
correspondingly with one of the two remaining LI - FWD 
primers. Subsequently, amino acids VL34-44 are appended 
to the primary PCR products in secondary PCR reactions by 
taking an aliquot of the primary reaction and carrying out 
secondary PCR with primers L1ALLFWD and L1ALLBCK. The 
products of the secondary reactions are kept separate and 
are labelled rVLl-44CSRl.l-5.lib.pcr. These constructs 
allow subsequent generation of all 5 known canonical CSR 
LI in the rVL.lib after cloning when these products are 
joined with the appropriate MIDVL section having one of 
three different amino acids in position VL71. Bach of the 
primary PCR uses Taq polymerase, FWD and BCK primers as 
noted above, in 50 pi reaction mixtures and is cycled 25 
25 times (94*C for 1 min, 60*C for 1 min and 72*C for 1 min) . 
The secondary PCR reactions (25 pi) use fresh Taq 
polymerase and 1 pi of amplified appended diversified 
primary PCR reaction mixture product, FWD and BCK primer 
pairs as noted, and the reaction is cycled 30 times (94 -C 
30 for 1 min, 55 -C for 1 min and 72 'C for 2 min) . A list of 
the sequences of all primers appears in Primer Table 
(Fig. 10) . 

In step C, the five products of the secondary 
amplif ication reaction of correct size, are designated 
35 rVLl-44CSRl. 1-5, and are isolated on low percentage 
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acrylamide gels, recovered, restricted and ligated to 
pCLONALL precut with prs4 and rs2 and cloned via 
electroporation (Dower, Miller et al. 1988) into P, CQli 
as described (step B, Pig. 7). These five 5'VL section 
products are designated rVLl-44CSRl.l-5.lib.bact . Twenty 
5 v clones of each library are checked first by diagnostic PCR 
and subsequently five (5) clones are analyzed for 
diversification of CSRl by automated sequencing as 
described above using pCFWD and pCBCK sequencing primers 
and commercially available kits < ABI, USA) . This procedure 
10 generates greater than 10 4 transf ormants per each of the 
five VL1 CSRs. 

Step Kb) : Construction of the MIDVL section 

In parallel fashion, a second set of reaction steps 
A-C constructs the MIDVL section of rVLlib. The MIDVL 

15 section originally contains amino acids rVL53-68. The 
oligonucleotides for this reaction are contained in the 
one pair c/C . 

In step A, each oligonucleotide is phosphorylated, 
the pair hybridized together under annealing conditions, 

20 an d the c/C double stranded DNA complex is purified and 

ligated in a 60 pi volume with 1200U of T4 DNA ligase (New 
England BioLab) to approximately 5 pg rs2 and prs5 cut 
pCLONALL (Sambrook, Pritsch et al. 1990). Ligated product 
is isolated from the mixture using Genecleanb II (Biol 01) , 

25 resuspended in water and used to transform E T coli via 

electroporation (Dower, Miller et al. 1988). After 1 hr 
in broth containing 1% glucose, the cells are placed on 
dishes in antibiotic containing media- After overnight 
incubation at 37 *C, individual colonies are picked and the 

30 MIDVL section transf ormants are identified from among 3 0 
transformants generated by diagnostic PCR. Confirmation 
of sequences is by automated sequencing using an ABI 
automated sequencer using pCFWD and pCBCK primers 
( ABI, USA) . Positives are labelled rVL53-68 .bact . and 

35 frozen glycerol stocks are produced. 
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In step B diversification, PCR is used to append 
diversified CSRL2 to the 5' end of MIDVL. Three different 
amino acids at VL71 (i.e., Y, F and A) followed by 
restriction site rsC between VL72 and VL76 followed by a 
rs4 restriction site are appended with primers to the 3' 
end of MIDVL. These additions are done in three separate 
reaction mixtures, one each containing FWD primer 
L2.71YFWD, L2.71FWD and L2.71FWD. All three FWD primers 
contain the rsC site which will allow joining of MIDVL to 
5'VL sections. For each of these reactions, the BCK 
primer is L2ALLBCK which contains an rsB site as well as 
DCSRL2 diversified at amino acid VL50 and 51. Each 
mixture contains a toothpicked frozed glycerol stock of 
rVl.53-68 (see Primer Table, Fig. 10) , Taq polymerase, in 50 
Ml mixtures, and is cycled 25 times (94 'C 1 min, 60 "C 1 
15 min 72 *C 2 min) . 

In the following step C, approximately 1 /xg of the 
amplified diversified appended MIDVL products are isolated 
using Magic PCR Preps ( Pr omega ) , cut with prsl and rs4, 
reisolated and ligated to 5 ^9 pdjONALL precut with prsl 
and rs4 in 60 >il volume with 1200U T4 DNA ligase (New 
England Biolabs) (Sambrook, Fritsch et al. 1990) . The 
ligated plasmid DNA products are isolated using Geneclean 
II (BiolOl) , resuspended in water and used to 
electroporate E. coli to generate, as noted above, a 
25 library of transf ormants (Dower, Miller et al. 1988). 

The three separate groups of successful transf ormants (one 
for each type of VL71) are identified by diagnostic PCR 
and confirmed regarding diversification of VLCSR2 by 
automated sequencing of 10 clones of each group. These 
30 transf ormants are designated 

rVL38-73CSR2:71 ( Y, F , A) lib . bact . This procedure gives >10 4 
transformations for each group. 

Step it (C) ; Construction of the 3'VL section of rVT. 

In the third .set of parallel steps A-C, the 3'VL 
35 section of rVL.lib is constructed. This section is 
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8 originally built to contain amino acids VL72-90 and uses 

the one oligonucleotides pair d/d' . In step A, this pair 
is phosphorylated and the two oligonucleotides annealed. 
The double stranded complex is then isolated and is 
ligated to pCLONALL precut with prsO and rs4' . Ligated 
5 product is isolated and used to transform E. coli via 
electroporation (Dower, Miller et al. 1988) as above. 
3'VL section transf ormants are isolated from among the 
transf ormants generated, and diagnostic PCR is preformed 
on twenty of them, the positives being confirmed by 

*0 automated sequencing and labelled rVL76-90 .bact . Frozen 
glycerol stocks are prepared. 

In the next phase, diversification (step B) , the six 
diversified CSRL3s, followed by a new prs5 site, as well 
as amino acids VL72-75 which contain the convenient 

15 restriction site (rsC) , are appended to VL76-90 to make 
the following 5'VL PCR product: rVL72-100CSR3 . 1-6. per . 
Diversification of CSR3.1-6 occurs at positions VL92 and 
93. These processes are done in six (6) separate 50 ftl 
PCR reactions each containing one L3.1-6FWD primer, all 

20 containing L3ALLBCK (see Primer Table, Fig. 10) , and Taq 

polymerase in 50 fil mixtures. The reactions are cycled 25 
times (94°C 1 min, 60°C 1 min and 72°C 2 min) . 

In step C, the amplified diversified appended 
products are isolated using Magic PCR Preps (Promega) , cut 
25 with prs2 and rs5, reisolated and ligated into pCLONALL 
precut with prsl and prsS. The ligated plasmid DNA 
products are isolated and used to electroporate E. coli to 
generate a library of transf ormants as noted above and 
designated rVL72-l00CSR3 . 1-6 .lib. bact . This procedure 
30 gives greater than 10 4 transformations which are 

identified by diagnostic PCR and sequencing to contain 
appropriately randomized amino acids at the diversified 
positions within VLCD3 for each of the six (6) VLCSR3S. 

35 
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Step 3: Ligation 

In step 3, the 5'VL and MIDVL sections are joined 
(see Fig. 7> . Five fig of DNA of each of the five 
rVLl-44.1ibs (i.e., CSR1.1-5) is digested with rsB and rs5 
and ligated to 1 fig of insert isolated from the three 
rVL38-70CDRL2:71* using 1200U T4 DNA ligase ((New England 
BioLabs) (Sambrook, Fritsch et al . 1990). In these 
reactions, ligation pairing of 5 ' VL [rVLl-44CSRs) to 
MIDVL [rVL3 8-76CSR2: 71*] is maintained as; 5'VLl.l-3 x 
MIDVL2:71Y; 5'VL1.4 x MIDVL2:71F and 5'VLl.5 x MIDVL2:71A 
to create the five rVLl - 76CSRD1 &2 . DNAs . Each of these is 
used to electroporate E. coli (Dower , Miller et al. 
1988) . 

The bacteria are then grown in broth containing 1% 
glucose for 1 h and are plated on dishes in antibiotic 
containing media. After overnight incubation at 37°C, 
individual colonies are picked and are characterized first 
by diagnostic PCR and then by automated sequencing. Some 
100 colonies are analyzed by diagnostic PCR and 20-30 by 
sequencing to confirm the random presence of different CSR 
pairing and diversified amino acids within the various 
CSRs. Frozen stocks of the five groups are then prepared 
and are designated rVLl- 76CSR12 . lib.bact . 

In step F, the extended 5'VL halves, consisting of 
the five rVLl-76CSRl&2.1ibs., are joined in 30 separate 
PCR reactions in combinatorial fashion with the six 3'VL 
halve sections, consisting of the six (6) 
rVL72-100CSR3.l-6.lib. This process generates 30 full 
length rVLl-100CSRl&2&3 .lib. (as diagramed in Fig. 7). In 
each of these library constructions, about 5 ^g of DNA of 
each of the five rVLl- 71CSR1&2 .libs (i.e., CSRl.l-5) is 
digested with rsC and prsS and ligated to 1 /*g of each of 
the inserts isolated from the six rVL72 - 100CSR3 . 1 - 6 
digested with rsC and prs5 using 1200U T4 DNA ligase (New 
England BioLabs) (Sambrook, Fritsch et al . 1990) to create 
35 the 30 rVLl- 100CSRD1&2&3 .dna preparations. Equal aliqouts 
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from each ligation mixture are pooled and the pooled DNA 
is purified using Geneclean II (BiolOl) and resuspended in 
30 /il water to create the completed rVLCL . lib . dna . PCR is 
then used to append to the 3' end of this DNA library, the 
nucleotides encoding the remaining amino acids of VL (i.e. 
rVLlOl-107) , amino acids at the 5 'end of CL (i.e., amino 
acids CL 108-110) , and within this sequence the convenient 
rs3 site. The rs3 site, also designated the rsCLLNK site 
(Fig. 9), subsequently allows the joining of rVL.lib with 
its cloned rCL section. 

These appending reactions are done by carrying out a 
PCR reaction with an aliquot of the purified 
rVL.1- 100CSR1&2&3 . lib.dna, the primers LtJCLLNKFWK and 
L1ALLBCK, and the Taq polymerase in 50 /xl volume mixtures 
cycles. The PCR reaction is cycled 25 times (94°C for 1 
min, 60°C 1 min and 72°C for 2 min) . 

Amplified DNA is then purified using Magic PCR Preps 
(Promega) . After suspension in water, 1 fig of the 
purified DNA is digested with rs2 and prs5 and ligated to 
5 fig of pCLONALL DNA precut with rs2 and prs5 using 1200U 
T4 ligase (Sambrook, Fritsch et al . 1990) and used to 
electroporate E. coli (Dower, Miller et al . 1988). The 
bacteria grown in broth containing l% glucose for 1 h are 
then plated on dishes in antibiotic containing media. 
After overnight incubation at 37°C, individual colonies 
are picked and characterized first .by diagnostic PCR and 
then by automated sequencing. Some 100 colonies are 
examined by diagnostic PCR and some (about 5-10) by 
sequencing to confirm the presence of amino acids VLl-llO 
and the random presence of different CSR pairings and 
diversification of amino acids within the various CSRs. 
More than 10 s transf ormants are generated in this process 
and a frozen stock of the library is then prepared and 
designated rVL.lib.bact . 

In the last step (step G) of rVL.lib construction, 
DNA from rVLlib is digested with prsl and rsJCLNK, and 1 
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/ig is ligated to 5 fig of pVLACCEPTOR (Fig. 9) , precut with 
prsl and rsJCLLNK, using 1200U T4 ligase (Sambrook, 
Fritsch et al. 1990). The product is then purified from 
the ligation mixture using Gleneclean II (BiolOl) and 
resuspended in water. This material is used to 
electroporate E . coli (Dower, Miller et al. 1988), and 
the bacteria are grown, after l hr in broth supplemented 
with 1% glucose, overnight at 37°C on dishes in antibiotic 
containing media. Individual colonies are picked and 
characterized by diagnostic PCR and automated sequencing 
to confirm the presence of CL in the library. Frozen 
glycerol stocks of rVLl-110ACSRl-31ib are made and 
designated rVLCL. lib .bact (Fig. 7) . 

The above detailed reactions where double amino acid 
randomization occurs within each CSR theoretically allows 
the construction of 2000, 4 00 and 2400 different CSR 
LI ,2,3 respectively, and a rVHlib size of 1.92 xlO*. This 
exceeds the largest published recombinant VL library made 
by similar (Griff iths, Williams et. al. 1994) technology by 
about 2 fold. 

Ix - Construction of the Cons tant regions of ABvyv 

The constant region (C) of the light (CL) and heavy 
chain (CHI) region for the selected parental Fab ABxxx 

(Fig. 9) is obtained either by annealing and ligating a 
series of synthetic overlapping oligonucleotides, as done 
for the V regions, or via standard PCR of the C regions of 
ABxxx or any other antibody mRNA or DNA with identical C 
regions. Nucleic acids encoding specific antibodies may 
be obtained from hybridmas from various sources including 
the ATCC. In either case, the constructions includes the 
removal of endogenous restriction sites that interfere 
with library construction and the creation of a number of 
convenient restriction sites at and around the 5' and 3' 
ends of the C regions so as to allow simple cloning into 
pCLONAL, pEXPRESSION and pV(H or L) ACCEPTOR (Fig. 9). 
For both CHI and CL regions, the C genes have inserted 
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within them an rs3 site for specific joining of v and C 
sections of rVL at or about the natural V/J gene junction 
for heavy and light chains. These sites are referred to 
as either rsJCHLNK and rsJCLLNK respectively. In 
constructing the C sections, these two junctional rs are 
5 appended by standard PCR using BCK primers CLBCK and CHBCK 
and FWD primers CLFWD and CHFWD (see Primer Table for 
sequence details (Fig. 10) . 

The parental C nucleic acid sequence of ABXXX is 
amplified by PCR with Taq polymerase using primers CLFWD 

10 and CLBCK which places the rs3 restriction site within 

the JC segment of the parental Fab at the 5' end of the C 
sequence and two stop codons (TAA) and the rs4' site 
(AscI) just outside the 3' -end of the C region. The 
reaction mixture (50 /tl) is cycled 25 times (94°C for 1 

15 min, 60°C for 1 min, and 72°C for 1 min.) and the 

amplified appended C sequence is purified using Magic PCR 
Preps (Promega) and resuspended in 50 jil water. 

The reaction amplifying the parental Fab CHI gene of 
ABXXX is identical, except for the following: the primers 

20 for the PCR reaction are different, being CHFWD and 

JCHBCK, and the CHFWD primer contains a Notl site at the 
3' terminus of the CHI region. 

To complete construction of the VLCL, the amplified 
and J appended recombinant VL diversified CSRl and 2 and 3 

25 (rVLCSRl&2&3) genes are joined to the amplified CL gene 

int he standard ligation fashion used above, or using PCR 
(Horton, Hunt et al . 1989). Assembly PCR reactions (25^1) 
use Taq polymerase, 1 /il amplified parental JC, and 0.8 fil 
of the rVL.lib gene from above. The appropriate VLBCK 

30 primer is used together with the CLFWD and the reaction 

cycled 30 times (94°C for 1 min, 60°C for 1 min. and 72°C 
for 2 min. ) . 

X. CONSTRUCTION OF rVHCHl . 1 ib ( Fig . 8 ) 

In the oligonucleotide phase (step A) , construction 
35 of a 5' and 3' half of the VH region is accomplished using 
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16 synthetic oligonucleotides, comprising 8 compl ementary 
pairs. Six oligonucleotides are for the 5' half and are 
labelled VH a-c with their complementary partners labelled 
VH a'-c Within the 5'VH half, the oligonucleotide b/b' 
pair has the rsB restriction site between amino acids rVH 
22-26. Ten oligonucleotides are for the 3' half and are 
labelled VH d-f and d'-f. Construction of the 3' half of 
the VH region is done in a similar fashion but uses three 
forms of the »e" complementary pair, designated as follows 
VH e/e', VH e2/e2' and VH e3/e3'. These correspond to the 
"e" oligonucleotides with either a valine (V), alanine (A) 
or arginine (R) at amino acid position VH71, respectively. 

In the annealing step, three types of the 3'VH half 
are constructed: 3'VHdef/d'e'f ' , 3'VHde2f/d'e2'f ' and 
3'VHde3f/d'e3'f . The variance in -e- oligonucleotides 
within the 3'VH half is necessary to allow for subsequent 
construction in the rVHlib of all four of the known CSRH2 
as outlined below. All oligonucleotides are synthesized so 
as to have a least one overlapping complementary sticky 
end, an absence of hairpin forming ends, and an absence of 
complementary sequences other than those of the desired 
oligonucleotide joining partner based on analysis by a 
commercially available oligonucleotide primer analysis 
software. 

Construction of 5' half of i-h«> vh R M i nn 
For constructing the 5' half of the VH region, the 
appropriate oligonucleotides are phosphorylated and are 
mixed together in one reaction mixture, after which they 
are heated and are annealed and ligated together using 
generally known molecular biology technology (Sambrook, 
30 Fritsch et al . 1990). As outlined, the first phase 
annealing and ligation (step A, Fig. 8) allows the 
formation of the 5' VH abc/a'b'C pair. in the next step 
(step B), the correct construct of 5' VH, containing a 
convenient rsB within its b/b' segment, is amplified with 
primers 5'VHFWD and 5'VHBCK (a list of names and sequences 
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for primers used in VHCHl.lib construction appears in the 
Primer Table, Fig. 10) by carrying out PCR on an aliquot 
of the ligated and isolated abc DNA duplex product of step 



A* In this step, an aliquot from the step A reaction is 
amplified using the above noted primers and Taq polymerase 
5 in 50/a1 reactions and is cycled 25 times (94°C 1 min, 60°C 
for 1 min # 72°C for 2 min.) . The amplified DNA is 
purified using Magic PCR Preps (Promega) and is suspended 
in 5 /il water. 

Next, the product of the amplification reaction 
10 having the correct size and designated rVHl-51, is cut at 
rs4 (Notl) and prsl. The cut fragment is purified by 
Magic PCR Preps (Promega) and 1 **g is ligated in a 60 pi 
volume with 1200U of T4 DNA ligase (New England BioLabs) 
to 5 of rs4 and prsl digested pCLONALL (Sambrook, 
15 Fritsch et al. 1990) . DNA is purified from the ligation 
mixture using Geneclean II (BiolOl) resuspended in 30 /il 
water and electroporated (Dower, Miller et al. 1988) into 
E . coli which is then grown in broth containing 1% glucose 
for 1 h and plated into antibiotic containing media. 

20 After overnight incubation at 3 7°C, individual colonies 
are picked and identified. Transf ormants containing the 
recombinant parental 5'VH half, rVHl-51, are identified by 
diagnostic PCR for appropriate size (with plasmid primers 
pCFWD and pCBCK) . Those transf ormants suspected of 

25 containing the rVHl-51 are expanded.. The nucleic acid 

amplified with PCR using PCFWD and.pCBCK are sequenced via 
automated ABI sequencing with commercially available kits 
as outlined by the manufacturer (ABI, USA) to confirm the 
identity of the rVHl-51 fragment. Cultures are then grown 

30 and stored as frozen glycerol (15%v/v) stocks and 
designated rVHl-51bact. 

In the next step, step C (Fig. 8), a diversified 
version of each of the four known CSRH2 is appended to 
rVHl-51. This process is done in four separate standard 

35 PCR reaction mixtures (see above) . Each reaction mixture 
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comprises the rVHl-51 fragment (obtained from toothpicked 
frozen glycerol bacterial stocks) , one of four FWD primers 
(H2.1FWD, H2.2FWD, H2.3FWD and H2.4FWD) and the BCK primer 
H2ALLBCK. The four FWD primers are constructed to span 
from amino acid 47 through 59 of CSR2 lu and contain amino 
acid diversification at position 53. The four library 
products, are isolated, and are cut at rsB and prs5, and 
then 1 /ig of each purified DNA product is ligated using 
T4DNA ligase to 5 /xg pCLONALL precut at rsB and prs5. 

As described above, the ligated DNA is purified and 
used in step D to transform E. coli via electroporation. 
Trans fozmants are isolated and characterized first by 
diagnostic PCR and then by automated sequencing to contain 
appropriate examples of the randomized diversified 
versions of all four CSRH1. Frozen stocks of each, 
designated rVHrsB-59CSR2 . 1-4 lib. bact. are made. 

Construction of the 3' Hal f of the VH Region 
In a parallel fashion, another set of reaction steps 
A-C are conducted to construct the 3' half of the VH 
region which incorporates nucleic acid encoding amino 
acids 57-95 of the variable heavy (VH) chain (Fig. 8) . 
The oligonucleotides for this reaction contain the three 
sets of pairs of VH oligonucleotides, e/e'and e2/e2', and 
e3/e3' in which amino acid VH71 is valine, alanine or 
arginine respectively. Appropriate mixing (as outlined 
above) allows for annealing and ligation of the three 
different rVH57-95 double stranded complementary 
oligonucleotides 3'VHdef/def (i.e., VH57-95 [71V] ) and 
3'VHde2f/d'e2'f (i.e., VH57-95 [71A] ) and 3 ' VHde3f /d' e3 ' f 
(i.e., VH57-95 [71R] ) . Aliguots of these three reactions 
are then amplified and appended with rsD and prs5 sites in 
step B by PCR using 3'VHFWD and 3'VHBCK. These reactions 
contain Taq polymerase, as described above, and are cycled 
25 times (94°C for 1 min, 60°C for 1 min, 72°C for 2 min) . 
The correct products are purified using Magic PCR Preps 
(Promega) , suspended in 50 M l water and are then cut at 
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prs2 and prs5 and reisolated. Approximately 1 fig of the 
reisolated rVH56-95 gene fragment is ligated into 5 /xg 
pCLONALL precut with prsl and prs5. Plasmid pCLONALL with 
the rVH56-95 insert is isolated and purified using 
Geneclean II (BiolOl) , and is used in step C to transform 
E . coli by electroporation (Dower, Miller et al. 1988) . 
Transf ormants are selected, and the correct three 
products, rVH56-95 :71V;A;R, are identified by diagnostic 
PCR and confirmed by automated ABI sequencing. Frozen 
stocks of each, designated rVH56-95 [71V; A; R].bact. are 



Completion of construction of the nucleic acids 
encoding the four known CSRH2 regions genes is 
accomplished in steps D and E. The three 
rVHrsD-56-71*-95-prs5 inserts, freed by digestion of 
15 plasmid DNA are ligated to the four rVHrsB- 59CSR2 . 1-4 .lib 
which have been precut at rsD and prs5. The resultant 
rVHrsB-95CSR2.1-4 library is cloned into E. coli using the 
standard purification, ligation and electroporation 
processes outlined above. Transf ormants are isolated and 
20 about 50 are characterized by diagnostic PCR and 20 by 
automated sequencing to confirm that they contain the 
expected diversified versions of the four known CSRH.\ The 
ligation combinations of rVHrsB-59 CSR2 and rVH56-7l*-95 
necessary to construct the fully diversified rVHCSR2 
library are rVHrsB- 59CSR2 . llib. with rVH56 -95 : 71V; 
rVHrsB-59CSR2 .21ib. with rVH56-95 : 71A; and rVHrsB- 59CSR2 . 3 
and 2.41ib. with rVH56-95:71R in steps D and E. 

Step F, comprises sequential PCR reactions to append 
to the 3' end of the four diversified CSRH2 constructs 
rVHRSB-95CSR2.1-4 diversified CDRH3s of different lengths 
and the convenient JCH1LNK restriction sites (i.e., rs3) , 
and at their 5' ends diversify their parental CSRHl and to 
append nucleic acids encoding VHl7-rsB-24. The final PCR 
products of these reactions are designated 
rVH17-118CSRl&2&3 .lib and contain all combinations of the 
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diversified known CSRH1 & 2's and diversified CDRH3 of 
three different lengths. 

These steps are carried out in the following 36 PCR 
reactions. Nine aliquots of each of the four different 
toothpicked frozen glycerol stocks of 

rVHrsB-95ACSR2.l-4lib.bact. are added to separate SO pi 
primary PCR reaction mixtures containing Taq polymerase. 
The forward primers H3.5FWD, H3 . 7FWD and H3.10FWD are 
added to 3 of the 9 tubes containing each of the four 
CSR2s.bact. To each triplicate set of unique forward 
primers is added one of the following: the BCK primers 
H1.1BCK, H1.2BCK, or H1.3BCK. These primary PCR reactions 
are cycled 25 times (94°C for 1 min., 60°C for 1 min. and 
72°C for 2 min.). Following completion of the primary PCR, 
aliquots of each of the 36 reactions are taken for a 
secondary PCR reaction with new Taq polymerase, and 
primers H31FWD and H31BCK. The secondary reactions append 
VH100-rs3-118-rs4 and VH17-rsB-24 to the 3' and 5' ends 
respectively. The products are designated 
rVHl7-ll8CSRl23.1ib. followed by a combination number 
(e.g., 1.1x2.2x3.5) which denotes the combinatorial 
arrangement of the three CDRHs in these products. Each of 
the 36 library products are characterized by diagnostic 
PCR and sequence analysis. Aliquots of the 36 libraries 
are pooled to generate the rVH17-ll8CSRl&2&3.1ib. 
25 in step G, DNA from the rVH17-ll8CSRl&2&3 library is 

digested with rsB and rs4. The digested DNA is purified 
using Magic PCR Prep (Promega) ligated into pCLONAL cut 
with rsB and rs4 , purified and used to transform E. coli 
as detailed above. The transf ormants are isolated, 
30 characterized and designated rVHrsB-118CSRl&2&3 .lib.bact . 

In step H the rVHrsB-rs3 inserts are removed from the 
DNA of the rVHrsB-H8&2&3.1ib using restriction enzymes 
specific for rsB and rs3 to form fragments designated 
rVHrsB-H4CDRl&2&31.3. These fragments are ligated using 
35 T4DNA ligase (New England BioLabs) to 5 fig rsB and rs3 
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digested rVHl-51-rs3 .bact . DNA. The product is then 
isolated, purified and used to transform E. coli to 
generate rVHl- JCHLNK-ACSR1&2 &3 lib. bact . Individual clones 
from the library are then isolated and their sequence is 
confirmed by diagnostic PCR and sequencing. The library 
5 is then stored as frozen glycerol stocks . The bacterial 
transf ormants containing this library contain the 
canonical CSRH1 and H2 regions diversified in greater than 
one amino acid position, and CDRH3 of three different 
lengths and diversified in greater than one amino acid 

10 position. This procedure gives at least 10 5 

transformations which are identified by diagnostic PCR and 
sequencing to contain appropriately randomized amino acids 
at the diversified positions within the CSRH2 and H3 
regions for the rVHl- 114CDR2- 3 .library . 

15 m step I, S pg of the rs2 and rs3 precut pVLACCEPTOR 

DNA (also referred to as pVH-CH f Fig. 9) is ligated to the 
rs2 and rs3 released insert rVHl- JCHIiNKDCSRl&2&3 . lib DNA 
(also referred to as rVHlib, Pig. 8) , and the recovered 
purified product is designated rVHCHl.lib. This 

20 rVHCHl.lib product is used to transform E. coli to 
generate a frozen stock of bacteria containing the 
rVHCHl.lib. Greater than 10 6 total members are obtained. 
XI. VH and VL library sizes; 

The above detailed reactions where two amino acid 

25 randomizations occur within each CSR theoretically allows 
the construction of 1200, 1600 and -12 00 different CSR 
HI, 2, 3 respectively, and a rVH library size of 2.3 xlO 9 . 
This exceeds the largest published recombinant VHCH1 
library made by similar technology (Griff iths, Williams et. 

30 al., 1994) by only about 2 fold. A smaller rVH library 
can be made using only 2 randomizations within the CSRH1 
and H2 and one randomization within each of the three 
differently sized CDRH3 . This procedure theoretically 
allows the construction of 1200, 1600 and 60 different CSR 

35 HI, 2, 3 respectively, and a rVH library size of 1.152 x 
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10*. This is similar to the largest rVHCHl library 
reported. The procedure outlined below allows subsequent 
pairing of individual members of such sized rVHCHl 
libraries with individual members of equally sized rVLCL 
libraries {i.e., of 10* as noted above and Fig. 4) on one 

5 

piece of DNA in single bacteria. Based on the sizes of the 

rVHCHl library and rVLCL library that are generated above, 

the potential size of the combinatorial rVab.lib (i.e., 

VHCHllib x VLCL lib) is greater than 10 u members (Fig. 4). 

XII. Construction of the rVab.lib (the VHCHllib 
10 x VLCLlib combinatorial l ib. ) (Fig . 11 . 12 . 14) 

In this section the phagmid (fd</>) which carry the 
rVHCHllib, designated Lox Receiver (LoxREC) ( f d0RECEIVER , 
Fig. 11) and the plasmid (p) which carries the rVLCL 
library, designated Lox Provider (LoxPro) (pUCl 9 PROVIDER, 

15 Fig. 11) are constructed and then are randomly recombined 
in vivo within individual bacteria onto a single phage 
vector (fd<£ CARRIER) which expresses the rVab rCHCHl and 
rVLCL genes and produces on the surface of the phage 
functional versions of the rVab rVHCLlrrVLCL proteins. 

so The rVab library construction phase is outlined in Figs. 
11, 12. 

Construction is begun by reamplif ication of the 
rVHCHl library maintained in the pVLACCEPTOR . lib.bact . 
using PCR, as described above, with primers pCFWD and 

5 pCBCK. The DNA product is isolated and cut with VHrs2 

(Ncol) and VHrs4 (Notl) and is ligated using T4 ligase and 
standard methodology into LoxPRO precut with Ncol and 
Notl. The LoxPRO used in this example is fashioned after 
fdDOGl-21oxVkdel as described by Griffiths, A.D. et al . 

5 1994) and contains an endogenous VHCH1, bounded by a Sfil 
and Notl rs, preceded by a ribosome binding site (rbs) , an 
in frame LpelB leader sequence (LpelB) , followed by an 
inframe wild type loxP sequence (Hoess et al . 1982) and 
then an inframe gpIII sequence. In LoxPRO, upstream from 

► the endogenous VHCH gene, and to be replaced by the 
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incoming rVLCL. lib. there is an endogenous CL gene which 
is preceded leader sequence which ends in a ApaLl in frame 
sequence which is followed by two terminator triplet 
codons. The endogenous CL gene is followed by two 
terminator triplet codons, an Ascl and Hindlll restriction 
site, and a mutant 511 loxP site (Hoess et al. 1986). DNA 
from the ligation mixture is purified and electroporated 
(Dower, Miller et al. 1988) into E. coli TGI (Gibson 1984) 
to create the pUC based library LoxPRO.rVHCHllib. (i.e., 
pUCLoxPROVIDER-rVHCHllib) . More than 10 1 clones are 
obtained and the diversity is confirmed by sequencing 
independent clones . 

In parallel, DNA is purified from the rVLCLlib.bact . 
(Fig. 8) and amplified by PCR with primers pCFWD and 
pCBCK. The PCR product is isolated, cut with VLrs2 
(ApaLl) and VLrs4' (Ascl) and ligated using standard 
methodology into fd based LoxREC (i.e., f dfDOGRECEIVER) . 
DNA amplified by PCR is purified using Magic PCR Prep. 
The DNA is then cut with ApaLl and Ascl and the digested 
DNA (about 6 fig), is purified on a 1.5% low melting-point 
agarose gel using Magic PCR Prep (Promega) . Approximately 
1 fig of the purified and cut rVLCL.lib DNA (Pig. 7) is 
ligated to about 5 §ig of digested f dDOG-21oxVkdel 
(Sambrook, Fritsch et al. 1990) in a 60 pi volume with 
1200U of T4 DNA ligase (New England Biolabs) (Fig. 11). 
Ligated DNA is purified from the ligation mixture using 
Geneclean II (BiolOl) , resuspended in 30 pi water and 
electroporated (Dower, Miller et al. 1988) into four 50 /*1 
aliquots of E. coli TGI cells grown in 1 ml 2 x TY broth 
containing 1% glucose for lh. Cells are then plated in 
dishes (Nunc) in TYE (Miller, 1972) medium with 12.5 /ig/ml 
tetracycline (TYE-TET) . After overnight incubation at 
37°C, colonies are scraped off the plates into 7 ml 2 x TY 
broth (Miller, 1972) containing 15% (v/v) glycerol for 
storage at -70°C. 
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The frequency of inserts is determined by PCR for 
each of the pools. Sequence diversity is confirmed by 
sequencing 8 clones of each pool. The pools are then 
combined to create the rVLCL.lib fdDOG-21ox rVdLlib. 
outlined above. DNA from the ligation mixture is purified 
^ and electroporated (Dower, Miller et al. 1988) into E . 
coli TGI (Gibson, 1984) to create the library 
LoxRECrVHCHllib . (i.e., pUC19-loxrVHCHllib) having greater 
than 5 x 10* clones. Diversity is confirmed by sequencing 
30 independent clones. 
10 Step 4: In vivo recombination of VHCH1 and VLCL genes 

In this step, summarized in Fig. 14, VHCH1 and VLCL 
genes are recombined in pairs, onto single pieces of DNA 
to make the rVab library. Individual members of the VLCL 
and rVHCHl library are placed within a single bacteria via 
15 sequential incorporation within that bacteria of the rVLCL 
member via phage mediated infection and of the rVHCHl 
member via DNA-mediated plasmid transformation. Once 
inside the bacteria, the two chains are combined onto the 
same piece of replicating DNA (fd^CARRIER) within the 
20 bacterium by the Pi CRE recambinase, provided by Pi phage 
infection, which catalyzes recombination at loxP site in a 
process termed ' re combinatorial infection' (Waterhouse, 
Griffiths et al. 1993). The process of recombinatorial 
infection for expressing recombinant proteins was 
25 originally described by Sternberg and Hamilton (Sternberg 
and Hamilton 1981); and Hoess et al. (Hoess, Ziese et al. 
1982; Hoess, Wierzbicki et al. 1986) which are 
incorporated herein by reference and depicted in Fig. 14. 
In the process according to the invention, only those 
30 bacteria transformed with a rVHCHl/rVLCL combination 

(i.e., an rVab member) survive. Given the size of the 
rVHCHl library (greater than 10 1 , see above) and the rVLCL 
library (greater than 10 1 , see above) , this type of 
combination, given unlimited bacteria, could yield a 
35 rVab.lib of greater than 10 17 members. 
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According to the invention, the diversified rVLCL.lib 
is cloned into a tetracycline* fd phage (1st antibiotic 
resistance) containing any VH chain which is easily 
recognized and which will be replaced later in the process 
by rVH.lib chains. The diversified rVHCHl chains are 
cloned into provider ampicillin resistant plasmids (2nd 
antibiotic resistance) . The two libraries are then joined 
in E. coli via phage infection with fd phage containing 
the receiver VLCL chains (the rVLCL.lib) of bacteria 
previously transformed with plasmid DNA containing 
provider VHCH1 chains. A 1 liter culture of these 
bacteria is then co- infected with fPl which is 
chorampenicol resistant (3rd antibiotic resistance) 
carrying the Cre recombinase. fd phage recovered from 
expanded colonies resistant to the antibiotics are used to 
infect E . coli . The percent of receptor phage with 
acquired rVHCHl genes from the provider vector is expected 
to be greater than 5% based on the assumption that each 
bacteria generates 60 phage after overnight culture 
(Griffiths, Williams et al. 1994) . It is also estimated 
that as long as this percent of the original triantibiotic 
resistant recovered cells acquires a rVHCHl chain from the 
provider vector, the number of different phage within the 
rVab library will be close to the number of surviving 
bacteria. 

Details of the Individual Steps for Expressing the 
rVLCL.1.6 and rVHCHl. L.b by CRE-LOX RECOMBINATORIAL 
FORMATION _ 

Phage PI ly sates are made by thermal induction 
(Rosner, 1972) . E. coli C600 Su- (Appleyard, 1954) 
containing phage PICm cl.lOOr-m- ( Yarmolinsky, Hansen et 
al. 1989) are grown in a 2 1 baffled flasks containing 1 1 
of 2 X TY, 25 fig/ml chloramphenicol, 10 mM MgS0 4 with 
vigorous shaking at 30 *C to an optical density of 0.6 at 
600 nm. The temperature is then raised quickly to 42 *C by 
shaking in a 70 *C water bath. Shaking is continued for 
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another 35 min. and then at 37 *C until lysis is visible. 
Cultures are centrifuged to remove debris and intact 
cells. Chloroform (100 §tl) is added to the supernatant 
and PI phage after 30 min. 30 *C infection of midlog E. 
coli TGI (Gibson, 1984) grown in 2 x TY broth with 5 mM 
CaCl 2 . Phage infected E. coli are tittered by plating E . 
coli on TYE medium (Miller, 1972) containing 30 /ig/ml 
chloramphenicol. Resistant colonies are counted after 24h 
incubation at 30 *C and when expressed as transducing units 
(t.u.) are greater than 10 9 /ml. 

One liter of 2 x TY broth containing 12.5 $tg/ml 
tetracycline (2 x Ty-TET) is inoculated with 10 9 E. coli 
carrying the rVLCL.lib cloned in LoxREC (i.e., fdD0G-21ox 
Vkldel Griffiths, A.D.,et.al. 1994). The culture is 
incubated for 12h at 30 *C in two 500 ml aliquots in 2 1 
baffled Erlenmeyer flasks. Polyethylene glycol is added 
to precipitate the phage (McCafferty, Griffiths et al . 
1990), which are then suspended in PBS (phosphate buffered 
saline: 25 mM NaH 2 P0 4 , 125 mM NaCl, pH 7.0) and filtered 
through a 0.45 fim sterile filter (Minisart, Sartorius) . 
The resulting phage, are tittered on mid- log E. coli TGI 
(30 min, 37 # C) and plated on TYE-TET, (Grif f iths , A.D. , 
et.al.,1994) reaches ~10 10 t.u. /ml. 

The recombination process is monitored by withdrawing 
aliquots of the phage infected bacteria and serially 
diluting the bacteria onto TYE plates supplemented with 1% 
glucose and a variety of the three antibiotics, ampicillin 
(100 fiq/ml) , tetracycline (15 Mg/nil) and chloramphenicol 
(30 /*g/ml) and calculating the library size. The rVHCHl 
library cloned into LoxPRO (i.e., pUC19-211oxVHdel in 
Griffiths, A.D., et al . 1994, see above) and contained in 
about 10 9 P r Coli, is inoculated in 100 ml 2 X TY broth 
containing 100 /xg/ml ampicillin and 1% (w/v) glucose (2 x 
TY:AMP:GLU) . An aliquot is taken for c.f.u titering and 
the remainder of the culture is grown overnight at 30 # C. 
A second aliquot is then taken for c.f.u. titering and one 
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5 ml aliquot is used to inoculate 500 ml of 2 x TY : AMP : GLU 
in a 21 Erlenmeyer flask and the culture is grown at 37 'C 
to an OD of 0.5 (600 nm). To this culture, 2 x 10 12 t.u. of 
rVLCL.lib in LoxREC is added and the culture is then 
divided into 5 x 100 ml aliquots. Each aliquot is mixed 
with 1 1 of 2 x TY:AMP:GLU f prewarmed to 37 # C, and 
incubated at 37 *C without shaking for 30 min, and then 
with shaking until they reach an OD600 of 0.4 (about 30 
min) . Aliquot s are then taken for c.f .u. titering. Two 
hundred ml of phage PICmcl - lOOr-m- lysate (about 6 x 10 n 
t.u.) are added to each flask (at an m.o.i. of about l) 
after the addition of CaCl 2 to obtain a final ^ 
concentration of 5 mM in CaCl^. This incubation is 
continued, with short durations of shaking every 15 min. 
for Ih at 30 *C, followed by the centrifugation at 5,000 x 
g for 15 min. The resultant pellets are suspended in 5 1 
2 X TYB (the original volume) with 100 /*g/ml ampicillin 
(100A), 12.5 ftg/ml tetracycline (12. 5T) and 25 jig/ml 
chloramphenicol (25C) and 1% glucose (1G) . An aliquot is 
taken for c.f .u. titering and the library size (number of 
ATC resistant c.f .u.) is confirmed to be greater than 10 10 . 
An aliquot is centrifuged at 12,000 x g for 5 min. the 
supernatant filtered through a 0.45 fim sterile filter, and 
the fd phage titer is determined by infecting log phase E^ 
coli TGI (30 min. 37 *C) and plating on TYE-TET. 

The culture, in 5 x 1 liter aliquots, is incubated 
overnight at 30 *C (all culturing is with shaking unless 
specified) for 24h in 2 1 baffled flasks. Aliqouts are 
taken for bacterial c.f.u. and fd phage (using log phase 
E . coli TGI) titering with the total yield of fd phage 
being confirmed to be greater than 10 13 t.u. The culture 
is centrifuged at 5,000 x g for 15 min. at 4*C and the fd 
phage are precipitated using PEP (McCafferty et al . 1990) 
and resuspended in a final volume of 10 ml PBS. 

Five 2 1 flasks, each with 1 1 2 x TYB, are 
inoculated with E. coli TGI and grown at 37 *C until 
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reaching an OD600 of 0,4 (about 4 x 10 n bacteria) . About 
1-2 x 10 12 t.u. rVab are then added to the 5 1 of E. coli 
and the cultures are incubated without shaking at 37 *C for 
30 min. The number of E. coli infected with fd phage is 
confirmed by plating bacteria on TYE-TET plates to be 
greater than 10 12 . Tetracycline (12.5 jig/ml) is then added 
and the culture is grown for 16h at 30 # C. The culture is 
then centrifuged at 5,000 x g for 10 min. and the pellet 
comprising the library is suspended in 250 ml 2 x TYB 
containing 15% glycerol and is stored in 15 ml aliquots at 
-70 # C. 

The efficiency of replacement of the endogenous VH to 
be exchanged in the phagemid receiver vector LoxREC with 
rVHCHl chains from the provider vector LoxPRO (i.e., 
pU19-21oxVHlib) (Griffiths A.D. , et .al . , 1994) , is determined 
to be less than about 20% by analyzing 200-300 individual 
colonies from the rVablib. Colonies are transferred onto 
TYE-TET plates and grown overnight at 30 *C. 
Identification of colonies possessing the recombinant VH 
genes is accomplished using colony hybridization 
(Tomlinson et al. 1992) with a primer complementary with 
the CDR3 region of the exchangeable VH of the LoxREC. 
Between 40-50 clones lacking the endogenous VH gene (i.e., 
the antiTNF VH as used in fdDOG-21ox Vdel by Griffiths, 
A.D. et al., 1994) are screened by PCR (Gussow and 
Clackson, 1989) for the presence of heavy chains with the 
primers similar to PELBBCK (5'GAA ATA CCT ATT GCC TAC GG) 
and CHI . LIBSEQFWD (i.e., 5'GGT GCT CTT GGA GGA GGG TGC) 
and for the presence of light chains with the primers like 
fdBCK (5'GCG ATG GTT GTT GTC ATT GTC GGC) and CL. (or 
CD LIBSEQFWD (respectively, 5'CAA CTRG CTC ATC AGA TGG CG 
OR 5'GTG GCC TTG TTG GCTTGA AGO (Grif f iths , A.D. , et al . 
1994) . Both chains are expected to appear among the 
clones at frequency of about 20-30%. 

Aliquots are then spread on TYE-TET in dishes (Nunc) , 
and are incubated overnight at 30 *C as well as being 
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tittered by serial dilution on small TYE-TET plates to 
allow determination of the number of colonies on the large 
plates. The plates containing the necessary bacteria to 
generate 10 7 clones are accumulated, and the bacteria are 
scraped into 10 ml 2 x TYB containing 15% glycerol to make 
5 stocks corresponding to rVab libraries of greater than 10 7 
clones . 

XII. Step 5 - Generating Phage and Displaying 
the rVa b.lib on Phage Surfaces (Fig. 14) 

I0 As constructed above, each phagemid carries and 

expresses an individual member of the rVab.lib. As shown 
in Fig- 14 , VHCH1 protein is expressed as a fusion protein 
coupled in open reading frame to the NH2- terminus of the 
fd gpIH coat protein gene and is therefore displayed on 
the mature phage surface as an attached surface protein. 
The VLCL protein, expressed via appropriate leader and 
double terminator codons as a soluble protein, is released 
into the bacterial periplasmic space wherein under 
reducing conditions it spontaneously forms active 
disulfide linked dimmers with VHCH to produce the desired 
functional recombinant rVab on the surface of the mature 
phage. Phage lysates expressing the entire combinatorial 
rVab library (one rVHCH and one rVLCL gene per phage) are 
made with the aid of helper phage. 
25 Phage, helper phage, plasmid construction, and 

titering are as generally described in the literature and 
phage and helper phage are available from commercial 
sources (Stratacyte CA, or Cambridge Antibodies 
Technologies, UK) . The lysates are in general made as 
follows: five 1 of 2 x TY-TET is inoculated with a 15 
(5-20) ml aliquot of the rVab phage library (greater than 
2 x 10 10 c.f-u.), the cultures are grown overnight at 30*C 
in baffled flasks (1 1 medium/fl) , centrifuged at 5,000 x 
g for 15 min at 4'C and the fd phage are precipitated with 
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PEP (McCafferty et al . 1990). Phage is then resuspended 
in a final volume of 10 ml PBS. 

These lysates are designated rVab.lib.F and have total 
yields of rVab expressing nature phage of from 10 13 to 10 14 
t .u. 

EXAMPLE 2 

Preparation of SOMERs For The Human Type 1 
Muscarinic Acetylcholin e Receptor 

In this example, following Stages I and II of the TSA 
process (Fig.l), rVabs from the rVab.lib are identified, 
isolated and used to establish an assay for small organic 
molecules (SOMER) which bind to and regulate the activity 
of only one subtype of human muscarinic cholinergic 
receptor (huAChRm) . Such SOMERS are useful new discovery 
leads for such diseases as Alzheimer's and other memory 
and learning deficits. The steps outlined below 
constitute Stages I- II (see Fig. 1) of the process of the 
invention and are those necessary to isolate from the 
rVab.lib those rVab members which bind (T+) to type 1 of 
the AChRm subtypes, regulate its activity (A+) , and are 
specific and selective (S+) for subtype 1 of the human 
muscarinic receptor (huAChRml) . Stage III of the 
invention, using these TSA+ rVabs to generate 3D models of 
ACHRml- specific pharmacophores (BEEPS, see below) and 
obtain SOMERs is briefly outlined at the end. 

Stages I -II detail the steps necessary to obtain and 
use the specific AChRml rVab to establish simple rapid 
radioreceptor assays for small organic molecules (SOMERs) 
which specifically bind and regulate huAChRml. As 
disclosed herein, and illustrated in Fig. 18 and 19, 
these rVabs are used to discover active surfaces on the 
huAChRml which are not present on the other huAChRm2-5 
subtypes. In addition, the rVabs may be agonists or 
antagonists at selective huAChRm subtypes (i.e., m,. 3 ) and 
may exhibit specif icity (S+ ) of action between one m 
subtype and the other four. 
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Phase I of this process reconstitutes functional 
hOAChRm which are the target of these assays. Phase II 
first identifies the rVabs contained within the rVab.lib 
which bind to huAChRml (i.e., are T+) , and are selective 
among the five huAChRm subtypes (Andre, Marullo et al. 
5 1987) as well as specific for huAChRm over non- cholinergic 
neurotransmitter receptors. In this example these two 
attributes are referred together as S+. Subsequently, 
Phase II identifies and isolates the subpopulation of TS+ 
huAChRm rVab which regulate the activity of the 
10 huAChRml (A+) with similar TS+ attributes. The rVabs with 

all these attributes are referred to as TSA+ rVabs. Phase 
III converts the TSA+ rVabs to reporters (i.e., 
rVab. reporters) and establishes validated automated rapid 
receptor binding screens for small organic molecules 

15 (SOMERS) which competitively displace active rVab 

reporters from active surfaces on huAChRml. Among these 
SOMERS are those having the desired activity profile of a 
pharmaceutical discovery lead, i.e., selective specific 
regulation of AChRml. 

20 Phase I* A : Obtaining AChRm 

Cortical membranes enriched in huAChmR are prepared 
from brains (fresh or frozen, human, porcine or bovine) as 
outlined by Haga & Haga (Haga and Haga 1983) . Membranes 
are prepared by homogenization in standard fashion (i.e., 

25 with protease inhibitors) and AChRm is solubilized by 
treatment with 1% digitonin, 0.1% NaCholate in 50 mM 
NaCl /buffer. The soluble receptor is purified over an 
3- (2 '-amino benzhydryloxy) tropane (ABT) affinity column 
and is eluted from the ABT column by atropine. Soluble 

30 receptor is subsequently applied onto a hydroxyapatite 

column to remove the free atropine. The receptor is then 
eluted with high potassium phosphate and 0.1% digitonin 
and is further purified through a second round of ABT 
purification as noted above. Two rounds of HPLC 

35 purification over tandem linked TSK4000SW and TSK3000SW 
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columns provides the final purification and the receptor 
is suspended in 0.1 M potassium phosphate with 0.1% 
digitonin. 

As a secondary source, the five huAChRml-5, expressed 
as recombinant proteins ( rhuAChRml - 5 ) in Sf9 cells 
^ containing an expression vector baculovirus construct 
carrying one of the huAChRm as originally described by 
Vasudeva (Vasudevan, Re i lander et al . 1991) are obtained 
from commercial sources (BioSignal, Inc., Montreal, 
Canada) . Other alternative sources of huAChRm are 
10 various tissue culture cell lines transfected and 

expressing cloned huAChRm (Kubo, Fukuda et al. 1986; 
Shapiro, Scherer et al . 1988; Buckley, Bonner et al . 1989; 
Buckley, Hulme et al. 1990; Tietje, Goldman et al . 1990; 
van Koppen and Nathanson 1990; Kashihara, Varga et al. 
15 1992; Beth 1993; Lazareno, Farries et al. 1993; van 
Koppen, and Lenz et al . 1993). 

Phase I-B ; Obtaining the G proteins (GP) 
Go, Gi and Gn (referred to as G protein [GP] in text 
and G in Figures) are purified as described (Sternweis, 
20 1984; Haga, 1986, and Haga, Uchiyama, et .al . , 1989 ) . Brains 
(150g) , porcine, bovine or human (obtained from commercial 
or non-profit sources) are homogenized, the membranes 
pelleted and then solubilized with 1% NaCholate in 20 mM 
TrisHCl (pH 8.0) 1 mM EDTA, 1 mM DTT (l%Cho-TED) with 0.1 
25 mM benzamidine (2L vol.). After centrif ugation, the 

supernatant is applied to DAE Sephacel and the fractions 
binding [ 33 S]GTPS are eluted with linear NaCl, in 
l%Cho-TED, concentrated, and applied and eluted from 
Ultrogel AcA 34 in 0.1M NaCl in Cho-TED. The fractions 
30 with [ 35 S]GTPS binding activity are pooled with TED + 0.1M 
NaCl (450ml) and applied to heptylamine-Sepharose, washed 
and finally are eluted with a linear gradient of 0.25% 
NaCho-TED + 0.2M NaCl vs. 1.3% NaCho - TED + 0 . 05M NaCl. 
This material (a mixture of Gi and Go) is applied to 
35 DEAE- Toyopearl , prewashed with TED + 0.6% Lubrol PX 
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(0.6%LPX-TED) and eluted with a linear gradient of NaCl in 
0.6%LPX-TED. The Gi fractions elute first, then the Go 
fractions. Each is collected separately and is stored at 
-80*C until use. Before use, the Lubrol is changed to 
0.8% NaCholate, in TED+0.5M K phosphate buffer 
* pH7 , 0 . lMNaCl ) on a small column of hydroxyapatite . 

Phase I-C : Reconstitution of an active AChRm ; GP complex 

Reconstitution is accomplished as per Florio and 
Sternweis (Florio, 1985) . Porcine [or human brain total 
lipids: as per Folch, J., Lees, M. , and Stanley, G.H.S. 

10 (Folch, Lees et al. 1957). The lipid mixture is prepared 
(Haga, 1986 ) from brain extract (Folch fraction I) (1.5 
mg each) and total lipids (1.5mg each) suspended in l ml 
HEN (20 mM Hepes-KOH buffer pH 8.0, 1 mM EDTA and 160 mM 
NaCl) with 0.18% deoxycholate and 0.04% sodium cholate. 

15 rhuAChRm (0.2-0.4 nmol/ml [ 3 H] QNB binding sites in PD ( . 5M 
potassium phosphate buffer pH 7.0 and 0.1% digitonin 
(10-40fil) ) are mixed with 0.1 mM oxotremorine in HEN, and 
then with 100 pi of lipid mixture (final vol. 200 pi) to 
give QNB-.R complex. The complex is then run through a 

20 Sephadex G50 column and the void volume (1-8 pmol [ 3 H] QNB 
binding sites, 400 pi) is collected. The huAChRm : QNB 
complex is mixed with G protein (mixtures or separate 
G-proteins, 0-200 pmol of [^SJGTPgS binding sites in 40 pi 
cholate solution) CN-TED and HEN (50 pi) containing MgCl 2 

25 and DTT (final concentration 10 and 5 mM respectively) and 

incubated at 0*C for 1 hr. This huAChRml:GP mixture is 

diluted before use with 3-5 vol of HEN. 

Phase I-D : Attachment of active huAChRm to matrices 
(Fig. 19) 

30 huAChRm (abbreviated AR in text and R or T in 

Figures) , alone or complexed with GP, is attached to a 
Sepharose (or agarose) -type matrix by taking 5 ml of 
matrix (WGA- Sepharose, mmolWGA/ml Sepharose, 50% v/v, 
prewashed and suspended in buffer A (25 mM Potassium 

35 phosphate buffer, [pH7.0],0.8 mM EDTA, 10 mM MgC12, 230 mM 
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NaCl, 0.06% BSA, and 4mM HEPES KOH buffer [pH 8.0]) and 
mixing it with less than 1 ml reconstituted AR:GP 
complexes (100 pmol AR/ml ) . The mixture is then incubated 
at room temp (r.t.) for 30 min # diluted with buffer A to 
20 ml and the Sepharose is allowed to settle (or 
centrifuge at low speed [5,000 x g, 1-2 min] ) . The 
Sepharose is then resuspended in 20 ml buffer A and the 
washes are repeated twice to provide purified AR 
complexed- Sepharose WGA [sWGA: ARGP] material. 
Recombinantly derived or native AR:GP complexes with 
appropriate sugar residues bound to WGA in this process 
remain active as matrix- attached receptor in agreement 
with published data showing glycosylation is not required- 
for AChRm activity (Habecker, Tietje et al . 1993). 
Quantitation of bound AR:GP to sWGA is verified by [*H] QNB 
± 10 jiM atropine and [ 35 S)GPTS or [ 3 H] GppNHp + O.lmM GTPS 
or GppNHp binding using standard binding assays (Berrie, 
Birdsall et al . 1985; Haga, Haga et al. 1986; Wheatley, 
Hulme et al. 1986; Poyner, Birdsall et al . 1989). 

In parallel reactions, AChRm (or GP) , natural or 
recombinantly expressed preparations, are attached by 
standard techniques to plastic, directly or secondarily, 
through matrix attached antibodies, naturally derived or 
rVab-type, which recognize epitopes on the receptor, 
glycoprotein, G-protein or small peptide tags (i.e., the 
c-myc and other amino or carboxy terminal in frame tagging 
peptides, available in various spaced commercial 
expression vectors) . After attachment of AR, the 
unoccupied reactive matrix surfaces are blocked by 
application of various standard blocking agents (i.e., 
BSA, milk etc. ) . 



In this stage, rVabs which possesses TSA+ attributes 
are identified as those which bind to AChR directly or 
indirectly attached to the matrix, with or without G, in 
buffer conditions similar to those used for AChRm 



Phase TI ; Panning for TSA+rVab 
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radioreceptor binding studies. These conditions maintain 
receptor activity- In all cases plastic and not glass is 
used for direct attachment matrix surfaces and reaction 
vesicles to minimize rVab nonspecific absorption to glass. 
The buffer for these reactions is a 10 mM potassium 
5 phosphate (pH 7.0), 0.8 mM EDTA, 10 mM MgCl 2 , 0.230 mM 
NaCI, 0.06% BSA, 4 mM Hepes-KOH (pH 8.0) buffer, and 
optionally further comprising guanine nucleotide (GTP) 
and/or muscarinic agonist or antagonist as detailed below. 
This stage isolates four types of A+ TSA+ rVab antibodies: 
10 agonist like (Ago+) , partial agonist-like (partAgo+) , 
allosterically agonist (Alloago+) and antagonist- like 
(Antago+) (outlined in Fig 19) . 

Phase II-A: Panning for receptor fTaraet (T+)1 recognition 
The general process is summarized in Fig. 16 and the 
15 specific application in Fig. 19: Five ml of the rVab.lib 
(10 nn PFU/5 ml, and suspended in buffer) is mixed with 1.0 
ml settled s-WGA:GAR in buffer A, and incubated at 30 "C 
with gentle shaking for 60 min. The mixture is then 
centrifuged at low speed (LSS) of 500 x g for 15 min. The 
20 supernatant is decanted and diluted with buffer A to 10 

ml. These washes are repeated 3 times rapidly and the rVab 
in the final pellet resuspended in buffer A and designated 
as the T* rVab.lib. (Fig. 19) . Phage are released by 
elution with lOOrriM triethylamine (Marks, Hoogenboom et al . 
1991). Aliquot s are withdrawn and tittered for phage. 
The population of isolated phage are then amplified by 
infection and induction of new lysates and panned again 
2-4 more times to generate the final T+rVab population of 
phage for subsequent isolation of the four types of A+ 
rVabs . 

Phase II-B : Panning for Active rVabs (A +rVab) (Phase IIB) 

In this process (general outline in Fig. 17, 
specific application in Fig. 19), the subset of rVab from 
the amplified T+rVab population which are potentially 
agonistic are induced by the addition of guanine 
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nucleotides to dissociate from the matrix attached R:G 
complex and be isolated as free TA+rVab in the 
supernatant. In this process, the rVab which bind and act 
as antagonists, or bind to nonactive surfaces, remain 
matrix- receptor associated after the addition of guanine 
nucleotide. The negative influence of GTP on T+rVab 
binding is taken as indicative of potential agonist action 
of the bound rVab based on the observation that in 
functionally coupled AR:GP complexes there is a negative 
reciprocal interaction between the binding of GTP or GDP 
to the G protein, and agonist to the receptor, which can 
be observed as an immediate dissociation of either from 
the complex (Smith, Perry et al . 1987; Poyner, Birdsall et 
al. 1989; Lazareno, Farries et al. 1993). No such 
reciprocal interactions occur between antagonist and 
guanine nucleotide binding (Buckley, Bonner et al . 1989). 

The TA+ rVab released into the supernatant are 
further separated and isolated as one of three types of 
agonists in separate panning steps (see below Phase 
IIB-i, ii, iii) . The specific muscarinic activity of the 
rVab is confirmed at the end of all isolations using 
AChRml activity assays in which potential TSA+rVabs (a) 
compete with radiolabelled antagonist (or agonist) , (b) 
dissociate prebound [ 33 S]GPTS or [ 3 H]GppNHp from matrix 
bound AChRm : GP complexes, (c) stimulate GTPase and or GTP 
exchange, and d) regulate the activity of other effector 
systems coupled to the AChRml (i.e., adenylate cyclase, 
phospholipase, K channels) in various published in vitro , 
cellular or animal assay systems (Yatani, Mattera et al . 
1988; Fraser, Wang et al . 1989; Shapiro and Nathanson 
1989; Kobayashi, Shibasaki et al . 1990; van Koppen and 
Nathanson 1990; Weiss, Bonner et al . 1990; Yatani, Okabe 
et al . 1990) . 

In Phase IIB, addition to bound T+rVab of ACh itself 
can also be used, via the same type of induction of rVab 
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dissociation from AChR, to isolate those rVab which bind 
not to the ACh binding pocket but to GP at active 
nucleotide binding surfaces or to other surfaces on AR or 
GP which are active and allosterically connected with the 
cholinergic binding surfaces of the AChRml . 

Specifically, at the start of Phase IIB, the 
amplified T+rVab.lib isolated in Phase IIA is mixed with 
matrix -bound AChRml :G complex, in 10 volumes buffer A as 
noted above for 30 min at 37 *C. The pellet is centrifuged 
at low speed, resuspend in 10 vol cold buffer A and 
immediately recentrif uged. The washed pellet is 
resuspended in 10 vol cold buffer A containing 100 uM GTP. 
After less than or equal to about 1 min. the matrix :AR:GP 
complex is centrifuged at low speed, and the supernatant 
is separated from the pellet to be used to isolate three 
different types of agonistic rVab in Phase IIB-i, ii-iii . 
The pellets are washed in similar fashion with buffer A 
three (3) times and analyzed in phase IIB-iv for 
muscarinic antagonist (Antago-O activity as detailed 
below. Throughout these phases, aliquot s of supernatant 
are taken to titer the phage, and if less than 10 6 /ml, the 
phage are amplified and recycled as above 2-3 additional 
times. To the final supernatant, containing rVab induced 
to dissociated via GTP addition, GTPase and GDPase are 
added and the supernatant incubated 30 min at 30*C. The 
solution is then chilled and passed over a Sephadex G50 
fine column using buffer A and the .void volume, free of 
any remaining nucleotides, is taken and labeled 
TA +CCTP+) rVab.lib. 



The T+rVab lib, for which binding is not modified by 
addition of GTP, arid which is recovered bound to matrix in 
the presence of GTP, is released from matrix and the phage 
harvested by PEP precipitation in Phase IIB-iv. The 
phage are then resuspended and mixed with s-WGA:RG in 2ml 
buffer A containing saturating amounts of antagonist 
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(atropine 10 iM, perenzepine 1 iM, scopolamine, 1 §M) . 
Following incubation for 60' at 30 "C phage and s-WGA:RG 
are centrifuged at low speed and the supernatant is 
collected. The free phage are isolated, and amplified (as 
noted above) and the population recycled an additional 2 
to 3 times by combining with s-WGA:RG to remove from the 
supernatant phage which, in the presence of antagonist, do 
not bind to s-WGA:AR. The phage in the final supernatant 
contain the expressed A+rVab members which are muscarinic 
antagonist -like (Antago+) are designated at the end of 
Phase IIB-iv, as TAntago+ huAChRml rVab.lib. [see Fig. 
19, rVab-4] . 

The pellet from incubation with muscarinic antagonist 
in the above Phase IIB-iv contains a T+rVab sublibrary 
which has members which interact directly with surfaces on 
the G protein of the AR:GP complex and are guanine 
nucleotide like regulators of the AChRml : G complex. Phage 
are freed from the matrix, amplified and incubated with 
matrix bound G- protein in buffer A. The matrix, and 
attached rVab, are then centrifuged, washed and attached 
phage isolated. Confirmation of G-like activity among 
these isolated rVabs is done in standard radioreceptor 
binding assays establishing competition with radiolabelled 
GppNHp or GTP y S for binding to GP. 

Phase IIB-i . ii. iiit Separating GTP Sensitive A+rVab into 
Acro + ( CT ^ CCh " c ) and alloAao^f^^^) AChRml - rVab ( Fia . l Q 

One to 10 ml of the TA +<GTP+) rVab . lib is mixed with 1 
ml sWGA-GR, incubated 60' min, 30 *C in buffer A with 300 
pM stable muscarinic agonist carbachol (CCh) and is then 
centrifuged at low speed. In Phase IIB-iii, the pellet is 
washed with buffer A three (3) times, and resuspended in 
buffer A and the phage isolated in standard fashion. This 
phage population, labelled TA+ (CTP+CCM, rVab . lib, contains 
the allosterically acting muscarinic like agonist 
(alloAgo+) rVab members (Fig. 19, rVab-3) . 
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The supernatant from the above Phase IIB incubation 
with CCh is passed over Sephadex G50 (fine) in Phase 
IIB-i,ii and the phage are collected in the void volume of 
the column (as outlined above ) to obtain CCh free rVab 
which are blocked from binding to AChRm by CCh. These 
phage are labeled as the TA +(GTP+CCH+) rVab. lib and contain 
the Phase IIB-i and ii rVab.lib members which are 
competitive -ACh muscarinic full(i) or partial (ii) 
agonist- like (Ago+) antibodies (i-e., rVab-1 and 2 in 
Fig 19) . 

Phase II-C : Separating Selective (S+) from non 
Selective tS-) TA+ rVabs 

All four types of AChRm A+T+rVab phage isolated in 
Phase IIB (labelled rVab-1,2,3 & 4 Fig. 19), are taken 
separately, and mixed with 1 ml sWGA : GR ml in buffer A 
containing soluble complexes of GP and AChR of subtypes 
2-5 (i.e., G:AChRm2-5 complexes). These complexes are 
added as the competing target peptide (analogous to 
comp-T-pep in Fig. 16) which contain greater than 10 fold 
excess of surface epitopes which are not to be recognized 
by the ml specific A+rVabs, incubated 30 *C, 60 min and 
then centrifuged at low speed ( Fig. 19). s The pellets 
contain the S+rVabs.lib members and these are resuspended 
in 10 vol buffer A and washed immediately. The phage are 
recovered in standard fashion, amplified and cycled 
through Phase IIC two to four additional times. Frozen 
stock bacterial cultures and phage lysates are prepared 
for each of the four A+ types of AChRml specific (S+) and 
are designated TS(Ago; partAgo; alloAgo; or 
antAgo) +rVab. lib. In an alternative embodiment , isolation 
of the AChRml specific rVab library is done on the 
T+ rVab.lib before selecting for the A+ rVab.lib (Fig. 16) 
and the population is amplified for subsequent A+ 
selection as defined above. 
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Stage II-E: Confirmation of A+ activity among 
individual members of the TSA+ rVab AChml lib 

Individual members (10-20) of each of the four A+ 
type TSA+ rVab AChRml library identified above are 
obtained and phage lysates are generated for each by 
standard technology. The A+ profile for individual phage 
members of each of the above four A+ library is confirmed 
and quantitated by a nM ED50 value in one or more of the 
following standard radioreceptor and receptor- coupled 
activity assays* The radioreceptor assays use 1) active 
soluble targets (i.e., AChRm, AchRm: G and G-protein 
complexes); 2) radiolabelled AChRm [ 3 H] agonist or 
antagonist, or [ 3 H, or 32 P]GTP, or GMPPNP or [ 35 S]GTPS in 
buffers used for rVab isolation; and 3) various 
dilutions of individual rVab members to be tested. The 
reaction mixture contents are incubated at 30°C for 30 min 
and the targets are recovered free of soluble radioligand 
by standard filtration or PEG precipitation. The 
reduction in specifically bound radiolabel is then 
quantitated. 

The degree of agonist activity for Ago+, partAgo+ and 
alloAgo+ rVab members is demonstrated by dose response 
alteration of any one of a number of AChRml coupled 
effector systems. Individual antagonism (Antago+) is 
demonstrated by dose response blockage of the ACh agonist 
effect on the particular receptor coupled system. 
Phase III. Con version of Selected A+rVab to rVab 



Reporters 

A. Preparation of Reporters and Competitive 

Binding Assays to Identify SOMERs (Fig. 18. 19) 

30 DNA is isolated from phage lysates prepared from 

bacteria grown from two to five individual TSA+rVab.bact 
stocks from each of the four classes of A+ libraries 
characterized above to have A+ activities with ED50 values 
of 1-30 nM. The DNA is digested with ApaLl and Notl to 

35 release from the fd^Carrier the rVLCL- rVHCHl rVab 
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construct. One |*g of the insert is isolated and mixed 
with 5 fig DNA from pEXPRESSORrVab (pEXPRESSORrVab- 1 , see 
Fig. 9) f precut with ApaLl and Notl, and 1200 U T4 ligase 
(Sambrook, Fritsch et al. 1990) . The ligated products are 
purified and electroporated into E . coli (Dower, Miller et 
5 al. 1988) . Trans formants are grown and characterized by 
diagnostic PCR and then sequenced. Correct constructs of 
each are then grown, the recombinant rVab (i.e., 
VHCH1:VLCL dimmer chains) induced and the rVab products 
are recovered in the supernatant by precipitation with 
10 Sepharose coupled VH or VL chain antibodies or antibodies 
to peptide sequences (IS0TAGS) included in pEXPRESSORrVab - 
I (Fig. 9C) and fused in frame to the carboxyterminus of 
CHI. The rVab are then released from the precipitating 
antibody. The VHCH1 chain of the rVab is then 
*5 phosphorylated in a constant region C terminal domain 

attached in frame (Li, et al. 1989) when rVab is ligated 
to pEXPRESSrVab. The phosphorylation reaction uses 
protein kinase and [ 32 P]ATP following published methodology 
and the radiolabelled product is isolated in the void 
20 volume of a G50 column. The radiolabelled rVab is mixed 
with BSA and stored at -4°C until use. 

To establish a saturation isotherm and ED50 for the 
labelled rVab with its active target (soluble or membrane 
bound; GP, AChrRml, or AChrRml:GP complexes), the binding 
25 of rVab is determined from reaction mixtures (50 pi) 

comprising from 1000-1,000,000 cpm.of radiolabelled rVab 
with and without 1000 folded excess of unlabelled rVab in 
buffer B. Identical control assays are done with 
AChRm2-5, AChRnicotinic, or other noh- cholinergic 
30 G-protein linked neurotransmitter receptors (e.g., 

beta -and alpha adrenergic, and opiate receptor) . These 
assays are incubated for 30 min at 30°C. The 
[ 32 P] rVab: target complex is PEG precipitated (or filtered 
with membrane bound target) and counted for radioactivity. 
35 The induced dissociation of rVab from its target by 
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an allosteric effector (i.e., the Ago+rVabs with GTP) 
defines the class of allosteric rVab agonists. A series 
of competition binding assays is then performed using less 
than, or equal to, the ED50 amount of [ 32 P] rVab with 
increasing concentrations of the nonlabelled form of the 
same rVab, other rVab, standard muscarinic specific 
ligands (agonists and antagonists), and a number of 
noncholinergic ligands as controls to further characterize 
these rVabs. 

These assays establish a saturation binding isotherm, 
an apparent Kd for rVab and target association, and IC50 
values for various ligands and other rVabs . The reactions 
carried out in the presence of increasing concentrations 
of other members of the same TSA+ rVab group define the 
rVab with the lowest IC50 value. This rVab is then 
converted to a radiolabelled form for use in obtaining 
saturation isotherms and various competition curves. In 
addition to the radiolabelled rVab, these assays further 
may contain 1) target agonist; 2) antagonist; 3) GTP; 
and 4) combinations of all three. Standards such as 
nicotine, muscarine, ATP, GMP, and the various small 
organic molecules previously reported in the literature to 
have affinity for regulation of AChRm receptor of the ml -5 
type regardless of affinity or selectivity may also be 
included. Saturation isotherms are generally conducted 
over a concentration range of four to six orders of 
magnitude. 

rVab's with affinity for AChRml of less than about 10 
nM, selectivity for AchRml over AchR types m2-5 of >100 
fold, and specificity regarding non- cholinergic soluble 
receptors of 1000 fold are appropriate as rVab- REPORTERS 
for A+ activity for use in Stages II and III of this 
invention wherein SOMERs are identified in CHEMFILES or 
synthesized based on BEEP models (see below) . 
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In the last three phases of the invention, which are 
part of TSA Stage III , the TSA+ rVabs are grouped 
according to common epitopes and attributes (Phase IV) , 
3D-models of active pharmacophores (BEEPS) are derived 
(Phase V) and the pharmacophores used to find SOMERs in 
existing CHEMFILES or by synthesis (Phase VI) . The 
grouping of TSA+rhuAChRml in Phase IV is accomplished 
according to a) the common surfaces recognized by the 
rVab (defined by competition by peptide fragments of the 
AChR; b) the type of activity exhibited by the rVab 
(partial or full agonist, antagonist, competitive or 
alios teric with ACh or GTP) and; c) the diversified 
amino acids of the V regions found in the rVab. 

The Stage III analysis of the TSA+rVabs which creates 
15 a 3D model pharmacophore (Fig. 23-25) is performed based on 
a genetic algorithm directed comparison of the array and 
positions of the amino acids in the V regions of the 
active rVab's, including CSR, CDR and framework residues- 
The 3D atomic model formulated by this process is 
20 designated a "biologically enhanced ensembled 

pharmacophore" (BEEP) . The BEEP contains sufficient 
information to describe the elements of a SOMER necessary 
for the activity profile of the active rVabs within that 
particular group. 
25 in Phase VI, the BEEP is used in a variety of 

available programs (HOOK, LOOK, and DOCK) for 
computational screening (Phase Via) of available CHEMFILES 
for huAChRml SOMERs and, in a rational drug design effort, 
to direct the actual synthesize of huAChRml SOMERs (Phase 
30 VIb) . SOMERs obtained by either approach are then 

confirmed as TSA+ AChRml agonists or antagonists in in 
vitro , cellular and animal assays, known to those versed 
in cholinomimetics. 

Additional diversification of TSA+ rVabs within CSRs 
35 and CDRH3 is carried out by PCR (as detailed in the 
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construction of the original rVab.lib) in Phase IVb 
whenever the number of rVab within a group is less than 10 
or when sufficient information is not obtainable from the 
number of A+ rVab's identified to develop BEEPS with the 
desired usefulness for identifying SOMERs and 
simplification of the TSA+ population is done when the 
number of rVab within a group is >100 (Fig. 15) 



This example outlines the TSA process establishing 
simple competitive binding assays for multimeric small 
organic molecules, which in this example are DISOMERs, 
capable of regulating the activity of growth hormone 
receptor. Here, D ISOMER discovery is based on the 
discovery of pairs of rVab which identify active surfaces 
on Growth Hormone Receptor and their conversion to 
rVab . REPORTERS according to the method of the invention. 

This methodology establishes a generic approach for 
discovery of drugs active at oligomeric receptor targets, 
or targets requiring activation at multiple sites of a 
monomeric unit. In such systems the "receptor" is defined 
by multiple surfaces which must be in contact with the 
signal to cause activation. 

The process of this invention provides a means of 
identifying active ligands for multiple site receptors a) 
which have more than one active surface; b) more than 
one subunit per active receptor complex; or c) different 
subunits and active surfaces. This method is also 
suitable where more than one subunit contains a portion of 
an active surface; the surface required for activation is 
too large to be occupied by a single small organic 
molecule present within a CHEMFILE; and activation of 
oligomeric receptors is intimately associated with the 
hormone induced formation of complexes of at least two 
receptor subunits (Cunningham, 1991; Kelly, 1991; DeVos, 
1992; and Wells, 1993). 
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Unlike standard screens to identify a single chemical 
entity to replace a large multi-site binding hormone, the 
approach described according to this invention, identifies 
pairs of active surfaces, finds SOMERs for each individual 
active surface, and then links the SOMERS together to 
create multimeric units (e.g. ,DIS0MER) large enough to 
replace the multivalent hormone, e.g., growth hormone 
(GH) . In the example provided, the target oligomeric 
receptor is the homo-dimeric growth hormone receptor (GHR) 
and the active surfaces identified are the two surfaces 
used by GH for active GHR dimerization. For GHR there is 
only one type of receptor subunit, referred to here as Tl. 
Activation of the receptor requires GH to dimerize two 
receptor subunits (Tl 2 ) by maintaining binding of active 
surfaces on two Tl. 

1. Identification and Isolation of rVabs 
Specific for GHR 

Step la: Identification of GHRT+rVab. lib 
for the Tl GHR Subunits 

Isolate from the rVab". lib the subpopulation which 
binds to the surfaces of the Tl GHR subunit. These rVabs 
are designated GHR . T+rVab . 1 ib . 

Library surface scanners are provided by the rVab.lib 
constructed as outlined in Example 1 of this invention. 
This rVab.lib, i.e., rVHCH:VLCL complexes, "is expressed on 
phage surfaces attached to the phage gpIII coat protein. 
A one ml aliquot of phage lysate (>10 l2 t.u.) is mixed with 
GHR receptor subunits (Tl) which are prebound to an 
immolized solid support i.e., agarose bead- type isolation 
matrix (mat-Tl) . In this example, the basic GHR subunit 
(Tl) used is that which encompasses only the excellular 
domain of the hGHR, including hGHR amino acids 1 to 238 
(Leung, 1987; Fuh, 1990) with an unpaired penultimate 
cysteine (Bass, Greene et al . 1990). This form is 
referred to as sGHR and is expressed in E. coli as an 
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extracellularly released soluble protein (Fuh, Mulkerrin 
et al. 1990). This soluble protein is then purified {Fuh, 
Mulkerrin et al. 1990) and bound to beads or plastic 
through its unpaired cysteine (Bass, Greene et al. 1990), 
or to plastic through an antibody which recognizes the 
sGHR but does not interfere with GH binding or active GHR 
dimerization (Fuh, Mulkerrin et al . 1990; Cunningham, 
Ultsch et al. 1991). All forms of sGHR bind GH as does 
the endogenous membrane associated entact GHR (Leung, 
1987; Fuh, 1990) . An excess of soluble prolactin receptor 
(PRLR) as competing peptide (comp-T-peptide) (see Fig. 16) 
or various mutant hGHR, or PRLR missing either H binding 
site I or II (Cunningham, 1991; DeVos, 1992; and 
Rozakis-Adcock, 1992) to compete binding of non-specific 
rVab binders which have no selectivity for GHR binding is 
routinely added to the mixture to define rVab specificity. 
With sGHR attached to 0.2 mg of oxivane polyacrylamide 
beads (Sigma) the reaction mixtures can be as small as 50 
ul beads. The excess of soluble prolactin receptor 
competes for binding of non-specific rVab binders which 
have no selectivity for GHR binding. The mixture is 
incubated for at least 3 hr at 30°C in buffer A which 
supports normal GHg and GHR association with one entity 
displayed as an attached phage coat protein (Bass, Greene 
et al. 1990) and consists of <50 mM Tris, pH 7.4, 1 mN 
EDTA 50 mM NaCl , 1 mg/ml BSA dnd 0.02% Tween 20 and washed 
three (3) times in 3 0C buffer A. The rVab bound to the 
matrix associated GHR, in the presence of the excess 
competing soluble non-GHR related peptide (i.e., the 
comp-T-pep) is designated the GHRTS+ rVab.lib. The phage 
30 are recovered by washing (2x) either in Buffer A with 20 
nM hGH or 0.2M glycine (pH2.1) (Bass, Greene et al . 199 0) 
and tittered. 

The phage libraries are mixed with E . coli (at a 
multiplicity of infection) of approximately one (l) , 
incubated without shaking for 30 min and then plated in 
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antibiotic media and grown overnight and tittered. The 
survivors are pooled and grown overnight and frozen as 
bacterial stocks, in 15% glycerol. An aliquot of the 
stock is grown up and new phage ly sates are made and 
tittered. This phage population, GHR.TS+rVab recognizes 
all surfaces on the Tl subunit of GHR. Definition of S+ 
in this population at this time is not mandatory, and can 
be omitted, i.e., by not adding prolactin receptor (or any 
other comp-T-pep) to the original reaction mixture above, 
if the number of GHR.TS+rVab members obtained in Step 1 
which are competed by GH (see below) is less than 100. 

An additional phase of V region amino acid 
diversification within CSRs and/or CDRH3, as per outlined 
in the Example 1 and summarized in Fig. 15, is performed 
if greater numbers of GHR.T+ or TS+rVab are desired. 

Step lb: Subdivision of TS+rVab based 
on GHR surface epitope recognized 



20 



lb) Group library members according to common 
receptor surfaces recognized. Designate groups as 
GHR(x-y) .T+rVab.lib, where x-y is the amino acid domain of 
the Tl unit containing the common group epitope (Fig. 16). 

Separation according to the receptor surface 
recognized is accomplished by adding aliquots of TS+rVab 
to plastic dishes to which have been preabsorbed peptides 
25 (obtained commercially) of 10-20 amino acid overlapping 

amino acid sequences of GHR and those domains containing 
amino acid sequences known to influence GH binding (i.e., 
hGHR amino acids 54-68, 171-185, 9 [GHR sitel] ) ; and 
116-119 and 8- 14 (GHRsitell) as described (Cunningham, 
3q Henner et al . 1990). TS+rVab are incubated with 

preadsorbed peptides in buffer A (20 mM TrisHCl buffer pH 
7.5, 1 mM EDTA, 0.1 % bovine serum albumen) for 3 hr at 
30°C. The dishes are washed to remove unbound rVabs . 
Bound rVabs are released from the matrix, tittered and 
amplified again via infection in E. coli . Binding to 
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these overlapping GHR peptides produces a grouping 
according to primary receptor amino acid sequence and 
hormone binding. Each of the separate groups are then 
mixed with soluble matrix- GHR (see step la) in buffer A 
with greater than 100 fold excess GH and incubated 3 hr at 
30°C and centrifuged. The phage in the supernatant are 
tittered, amplified and further enriched by panning 2-3 
additional times for TS+rVabs which do not bind to GHR in 
the presence of GH. This recycling produces a population 
of GHR.TS+rVab which bind to a surface of the GHR which is 
normally occupied by bound GH. Although these steps do 
not identify and/or subdivide all GHR hormone related 
epitopes, they divide the original GHRTS + rVab . 1 ib into 
workable sized subgroups based on binding to various amino 
acid sequences and domains involved in GHR recognition. 
Each group is tittered, amplified, infected into E . coli 
and bacterial stocks and subsequent new phage lysates are 
prepared. Each group is designated by its amino acid 
receptor sequence or domain recognized (e.g., amino acid 
x-y) as follows: GHR.T (x-y) S+rVab. lib. Competition by 
these rVab for I125hGH binding to sGHR is done in standard 
binding assays (Spencer, Hammonds et al . 1988) in buffer A 
with terminated by precipitation by polyethylene glycol 
8000, at 4°C in phosphate buffered saline as described 
(Leung, 1987) . Competition binding to membrane associated 
GHR is performed under identical conditions and reactions 
are terminated by filtration and washing. 

2. Formation and Identification of 

Bifunctional Active rVabs Possessing 
Random Sequences of Amino Acid 

Step 2?: Preparation And Expressio n Of rVab-Pep Libr-aT-y 

2a) Attach a random 8 amino acid peptide library 
(Pep8) in frame to the light chain (VLCL) of all members 
of a rVab library recognizing a common GHR surface (Fig. 
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11 and 20) . Designate these bifunctional surface binder 
libraries GHR(xy) .T+rVab-pep.lib. 

Each of the group libraries is genetically engineered 
to be expressed, in a coupled manner, with a short random 
peptide of 8 amino acids (pep 8) attached through a short 
linker (LNKR) to one chain of the rVab (Fig. 11) . 
Attachment can be at different positions on different 
chains depending upon which Cre-Lox recombination system 
is used to combine the rVHCHl.lib and rVLCL.lib onto the 
same piece of DNA when the rVab.lib is made (see Fig. 11 
vs. 13) . In this example, the rVab.lib is made according 
to Example l (Fig. 11) and attachment of the pep 8 is to 
the amino terminus of the VL region of the rVLCL.lib (Fig. 
11 and 20) . In Example 4 below, the construction of a 
different rVab.lib where addition to a single pep8 could 
15 be made to either the carboxy terminus of the constant 

domain (CD of the rVLCL or to the aminoterminus domain of 
the VH of the rVHCHl is described (also see Fig. 13) . 

In this example, attachment is accomplished by using 
PCR to append the pep8 library to the 5' end of the VL 
20 region within the rVLCL members of the GHR . TS +rVab . 1 ib . 

This reaction uses forward primer CH209-216-NotlFWD and 
back primer APAPEP8LNKRBCK (i.e., leader 

seq.Apal- (NNN) g (GGGGS) ,VLl-7) (see Primer Table, Fig. 10). 
These reactions contain am aliquot of bacteria from each 

25 GHRT(x-y)S+rVab.lib. , Taq polymerase and forward and back 
primers and are cycled 25 times (94°C 1 min, 60°C for 1 
min and 72 °C for 2 min) . The amplified, appended DNA is 
purified using Magic PCR PREPS (Promega) and after 
suspension in water, 1 /xg of the purified DNA is digested 

30 with Notl and Apal and ligated using 1200U T4 ligase 

(Sambrook, Fritsch et al . 1990) to f drVabpCARRIER (see 
Fig. 11) precut with Notl and Apal. The ligated product, 
designated f drVabPEPpCARRIER is isolated with GeneClean 
and elect roporated into E. coli . Trans fonnants are grown, 

35 tittered and frozen stocks are made. A sufficient number 
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of colonies are picked and sequenced to confirm the 

presence of the random pep8 library. The bacteria, 

designated GHRT(x-y) .rVab-PEP. lib.bact are then grown and 

phage are induced for expression with helper phage so that 

the GHRT(x-y) rVab-pep constructs are displayed on the 

phage surface attached to gpIII. (see Fig. 20). 

With the amino acids of the octapeptide being random 

at each position, there are greater than 10 ,D peptide 

combinations for each library. Accordingly, with less 

than about 100 GHR.TS+rVab in each group the combinatorial 

rVab-pep.lib number is less than 10 l2 and is therefore 

accommodated in a normal phage lysate. If the number of 

GHR.TS+ rVab is greater than about 100, the random 

octapeptide library is expressed alone as a fusion protein 

fused to the gpIII on the surface of fd phage via the same 

linker ([GGGGSJ2) and the octapeptides which recognize GHR 

surfaces are isolated first by panning over matrix 

attached GHR complexes. Those phage which stick to the 

matrix, are isolated, amplified and the oligonucleotide 

sublibrary encoding the pep8 octapeptides which bind to 

GHR are excised and amplified with primers containing a 

leader restriction site (in the BCK primer) and ApaLl (in 

the FWD primer) . This smaller pep8 oligonucleotide 

sublibrary, which is T+ (pepT+) , is then ligated into the 

grouped GHR.TS+ rVab. lib precut at the rsl site in the VL 

Lgplll leader sequence and at Apa 1. (See Fig. 11D) to 

produce a GHR.TS+rVab-pep (T+) library. In such cases the 

members of this combinatorial library, less than 10 u , are 

grown, the phage induced and the library of surface 

attached GHR.TS+rVab-pep(T+) harvested and tittered. 

Step 2b; Identif ication of Active Bivalent rVab-P^ >p 

Memfrers 

2b) Isolate GHR (x-y) T+ . rVab-pep members which 
actively (A+) dimerize the receptor as does GH. Label 
these GHR (x-y) TA+rVab-pep. 
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The bivalent rVab-PEP, are expressed as a phage 
displayed library and are panned for combinatorial members 
which actively dimerize GHR. The positives are labeled 
GHR.rVabT(x-y)SA+-pepTl+.lib. In this step, activation is 
recognized by the occurrence of one or more of the 
5 following observable events: 1) dimerization of two GHR 
Tl subunits; 2) dimerization of two Tl subunits which 
allow fluorescence transfer between the same or different 
modified amino acids in the two subunits as described by 
Cunningham (Cunningham, Ultsch et al . 1991); 3) dimmer 
'0 formation which generates an antibody recognized epitope 
which contains amino acids from two Tl subunits which 
occur only in activated dimeric Tl 2 structures (Taga, 
Narazaki et al . 1992); 3) GHR -GH- GHR -matrix complexes 
which are dissociated by wild type hGH, or only a mutant 
15 hGH with only site I or site II binding capability 
(Cunningham, Ultsch et al - 1991); or 4) antibody 
recognizable phosphorylation of one of the receptor 
subunits associated with active receptor dimerization. In 
the later case, incubation of GHR.rVabT(x-y)S+pep.lib with 
20 and PKC is carried out before panning and the ATP and 

PKC is present during the panning procedures. It is also 
possible to monitor for in vitro active dimerization by 
the co-presence of some third GHR associated protein in 
the active complex (Taga, Narazaki et al. 1992). 
25 2c) Confirm activity by testing for activation of a 

cell associated GHR. Those GHR . TS A+ rVab - pepT which appear 
active in vitro , are tested in an intact cell assay system 
such as GH induced growth of myeloid leukemia cell line 
FDCP1 expressing hybrid extracellular domain 
30 GHR- intracellular granulocyte colony- stimulating factor 
receptor (GCSFR) (Fuh, Cunningham et al . 1992) or IM-9 
cells (Silva, Weber et al. 1993) to confirm the agonist 
nature of the rVab-pep complex. 
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3. 



Identification of Active GH-rVab Pairs 
For Use As Reporters 
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Step 3ft- Expression of Sol uble rVabs 

3a) Identify from among the members of different A+ 
rVabA+-pep groups, those which have a rVab which by itself 
competes with the peptide member of the same or different 
rVabA+-pep group. This is accomplished by carrying out 
competition binding assays designed to identify those 
rVabs and peptides which compete with each other for 
binding to the GHR . The peptide portion of an active 
rVab-pep is separately expressed without the corresponding 
rVab to perform these binding assays. By this process 
rVabs which can mimic and replace the pep8 portion of an 
active rVab-pep member are identified. The rVab of a 
first A+rVab-pep member and the rVab of a second 
A+rVab-pep member which competes with the peptide portion 
of the first member, are designated an active pair of 
GH-rVabs. 

Specif ically, after confirmation of activation is 
obtained, the active rVab-pep are modified by appropriate 
digestion of the construct to allow expression of soluble 
rVab without any linkage to phage coat protein gpIII and 
to the octapeptide as. well. Such simplified entities are 
labeled rVabTS+A*. To prepare the modified constructs 
allowing for expression of free soluble rVab, DNA from 
rVab-pep is obtained, digested with Apal and Notl and 
isolated. One /*g of the isolated DNA is then ligated with 
5 /ig pEXPRESSIONrVab DNA precut with ApaLl and Notl by 
incubation with T4 ligagse. The ligated products are 
isolated by GeneClean II and electroporated into E . coli 
and transf ormants obtained and confirmed by diagnostic PCR 
and sequencing. Frozen stocks are prepared. These stocks 
are denoted GHR . rVabTS+A* and not A+ because by themselves 
they cannot activate the GHR but are members of active 
pairs (i.e., rVabs and pep8s) which do activate the 
receptor. Expression of the octapeptide member of the 



WO 96/04557 PCT/OS9S/101M 



- 143 * 

o ... 

active rVab-pep is carried out by excision and ligation of 
the oliognucleotide portions encoding the pep8 and 
transfer to expression vectors in which the pep8 is 
expressed as a soluble extracellular entity fused with a 
easily purifiable tagged carrier protein (using a variety 
^ of commercially available expression vectors) or attached 
via GGGGS linker to gpIII coat protein and displayed as a 
phage surface entity. These entities are labelled pep8A* 
and are used as described below to identify rVab for the 
other portion of the GHR active surface utilized by the 
10 active rVab-pep entity. 

3b) rVab and pep8 members of active pairs are 
grouped according to common GHR surfaces recognized (as 
described above) . 

4. Preparation of G H-rVab- Reporters 
15 Convert a rVab representative of at least one active 

pair of GH-rVabs into a GH. rVab- Reporter. 

The CH domain of the heavy chain of the rVab is 
labelled (as described in Example 2) and the labelled 
entity, designated GH.rVab- REPORTER, is used to establish 
20 saturation and competition binding assays as described in 
Example 2 . 

The isolated and expressed separated pep 8 members 
from active rVabA+-pep constructs are used in standard 
binding competition assays to identify (see Fig. 11) those 

25 GHRrVabT+ which bind to the same GHR domain as the pep 8 

entities. Those which compete are designated as the second 
member of the active pair of rVab for the two active GHR 
surfaces required for receptor activation. This second 
member is then converted to a rVab-Reporter (see above) . 

30 The rVab member of the rVabA+-pep construct from which the 
pep8 was obtained is the second member of the active pair. 
Step 5: SQMER SCREENING 

Establish binding assays with each member of an 
active pair of GH. rVab- REPORTERS for a pair of SOMERS, 

35 each capable of binding to at least one of the two domains 
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of an active pair of receptor surfaces involved in active 
GHR dirnerization. 

The GH.rVab- REPORTER is used under standardized and 
automated binding assay conditions to identify SOMERs 
within a chemical data base (i.e., CHEMFILE) which will 
compete at an active* (A*) surface on the Tl subunit of 
the GH receptor. These SOMERs are designated S0MER-T1. In 
a parallel fashion, using the other rVab-Reporter member 
of the active rVab pair (as defined above) SOMERs are 
isolated for the second active surface on GHR required for 
its activation (Figs. 21 and 22). The SOMERs which 
recognize the second site are designated SOMER-T1. 

Identification of specific interaction with site I 
(i.e., Tl) or site II (i.e., Tl') of huGHR is made in 
binding assays measuring the ability of these entities to 
compete with mutant 1251 -GH which can only bind to site I 
or II as described (Cunningham, Ultsch et al . 1991). 
Step 6; PISOMER Preparation And Identification of Drug 

In the last step of this process, SOMER-T1 and 
S0MER-T1' are covalently combined to create a bivalent 
SOMER (i.e., a DISOMER) which can recognize the two sites 
of the active surface pair, i.e., the Tl and Tl' receptor 
dimmer subunit active surfaces. This DISOMER can actively 
dimerize the GH receptor subunits as does the native 
hormone. Confirmation of DISOMER GH activity is obtained 
in standard radioreceptor binding assays (competitive with 
intact labelled GH) for GHR binding and standard activity 
assays (in vj.tiTQ and/or GHR cellular activation systems) . 
Additional assay systems for active hormone receptor 
subunit oligomerizations in which a free excellular 
receptor .-hormone complex associates with other membrane 
proteins in intact cells to form active oligomeric 
complexes which direct auto-, and substrate 
phosphorylation, and other down stream activation 
responses (Taga, Narazaki et al . 1992). 



WO 96/04557 



PCT/US95/10182 



- 145 - 

o 

Steps 1-4 of the process, which find active surface 
landscapes involved in active dimerization of two TI 
subunits of GHR are outlined in Figs. 20, 21 and 22. Fig. 
20 is a flow diagram for creation of rVab-pep. libraries 
and isolation of rVab- peptides for the two active GHR 
5 surfaces. In the example presented here of oligomeric 

receptor targets, there is only one type of subunit (Tl) 
in the active GHR dimer complex, and therefore subunit T2 
- Tl. Fig. 21,22 illustrate GHT1- and GHTl'-SOMER and 
GH-DISOMER (i.e., GHT1-GHT1' ) identification. 
10 EXAMPLE 4 

Example 4 is a variation of Example 3 which 
recognizes the fact that many hormonal receptors are 
comprised of different receptor subunits. Often at least 
two or three subunits which may all be different from each 
15 other are required for activity. In these cases, hormone 
induced receptor oligomerization associated with receptor 
activation, requires interaction of the hormone with at 
least three active surfaces, each being on a different 
receptor subunit. Examples of heterodimeric (alpha/beta, 

20 or alpha/gamma) receptors include the group of interleukin 
(ID IL3, IL4, IL5, IL7, IL9 receptors and the GMCSF receptor, 
and the group of growth factor FGF, PDGF, CSF and NGF 
receptors, while an example of a heterotrimeric receptors 
( alpha, beta and gamma) is the IL2 receptor (see reviews 

25 Pierce, 1989; Boulay , 1993 ; Cosman, 1993; Kishimoto, 1994; 
Kaushansky, 1993; Kondo, 1994; Noguchi, 1993; Russell, 
1993 and Bamborough, 1994) . 

The use of rVab to identify active surfaces involving 
two or more sites distributed on multiple subunits 

30 involves certain adaptions from the process used when 
activation requires only one site. First, with the 
heterooligomeric receptors, a different rVabT(x)S+ lib is 
identified for each subunit (x) using the soluble receptor 
subunits as initial targets (e.g. Tavernier, 1991), as 

35 Second, that for trimeric receptors two random peptide 8 
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libraries are attached to each rVabT(x)S+ library. Third, 
where the rVab is T+ for the alpha receptor subunit (i.e., 
rVabT+S) , the other two members of the active trio (i.e., 
those binding to each of the other two subunit surfaces 
necessary for active receptor trimerization) , designated 
rVabT+ and rVabT+, are identified as those which compete 
for binding with one of the two octapeptide members of an 
active rVabTSA+-pep 2 . For such trimeric receptors, the 
individual rVHCH.lib and rVLCL.lib made in Example 1 are 
combined into different fdRECEIVERs and pUCl 9 PROVIDERS as 
detailed in Fig. 13. 

In this application, rVHCH.lib is placed into a 
fdRECEIVER which allows expression of rVHCH fused to gpIII 
coat protein and with, or without, peptide (preferably 8 
amino acids) attached to its aminoterminus . The rVLCL.lib 
is placed into a pUCPROVIDER which allows for expression 
of rVLCL as soluble entities with, or without, peptide, 
preferably 8 amino acids, attached to its CL domain. 
After vivQ Cre-Lox -recombination of these two 
libraries, as detailed in Example 1, (see also. Fig. 13) 
the product rVab.lib is cloned as a single fdDNA 
designated f drVabPEPCARRIER . rVab members which bind to 
each of the receptor subunits (i.e., Tx+rVab) are then 
isolated and grouped as described in Example 3 . 
Subsequent addition of one or two random octapeptide 
libraries (Pep8°) , which in some cases have been 
prescreened and selected for binding to an identified 
receptor subunit is accomplished via PCR. As described 
above and in Fig. 13, oligonucleotides encoding the 
peptides are added to the DNA encoding the rVab library 
using FWD primer CLLNKPEPFWD 

(Ascl- (NNN)8(GGGGS)3CLL208-216) and VHLNKPE PBCK 
(rsPELB- (NNN) 8 (GGGGS) VH1- 8) together or in combination 
with primers having no Pep8 or linker- appending 
sequences. Use of one of these primers with a primer 
devoid of a Pep8 library could be used to generate a rVab 
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with one attached pep8 (i.e., rVab-Pep8 ! ) as described 
above in Example 2 with the single Pep 8 library appended 
through linker to the either the aminoterminus of the 
rVHCHl member or the carboxyterminus of the rVLCL member 
(Fig. 13) . 

5 According to this process, each attached peptide and 

the rVab portion of the rVab-PEP 2 each bind to a specific 
target site. Binding to all three sites is required for 
activity of the receptor. Therefore, the trimeric 
rVab-PEP 2 unit defines three binding domains: one defined 

10 by the rVab portion ((T(x)), and one each by each of the 
pep8 (i.e., pep8* and pep8 2 ) present in the construct. 

Isolation of active rVab- Pep 2 members utilizes 
enrichment cycles in which all three receptor units are 
complexed together in active trimeric structures. Such 

15 structures, complexed with their phage expressed rVab-Pep 2 
entities, are enriched by use of matrix-bound active 
subunits, antibodies to each of the three units, 
antibodies to modifications of receptor units which occur 
upon active oligomerization, such as phosphorylation or 

20 association with additional non- receptor membrane 
components (Argetsinger , Campbell et al . 1993; 
Silvennoinen, Witthuhn et al . 1993; and Witthuhn, 1993). 
Confirmation of agonist and antagonist activity is done 
using standard hormone : receptor binding assays to 

25 establish competitive binding of hormone to its receptor 
(Kitamura, Sato et al . 1991; Imler and Zurawski 1992; 
Pietzho, Zohlnhofer et al . 1993) and cellular receptor 
dependent activity assays measuring growth, DNA synthesis, 
protein phosphorylation etc. (Yokota, Otsuki et al . 1986; 

30 Pierce, Ruggiero et al . 1988; Solari et al. 1989; 

Anklesaria, Teixido et al . 1990; Heidaran, Pierce et al . 
1990; Pierce, Di et al. 1990; Heidaran, Pierce et al . 
1991; Keegan, Pierce et al . 1991; Murakami, Narazaki et 
al. 1991; Kruse, Tony, et al. 1992; Otani, Siegel et al . 
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1992; Taga, Narazaki et al. 1992; and Wang, Ogorochi et 
al. 1992) 

For a given rVab-Pep8 2 , we identify rVabA*s which are 
contained in other rVab-Pep8 2 A + which bind to each of the 
target sites bound by the peptides of the original active 
rVab-Pep8 2 (trimer rVabA* unit), following the same 
process outlined in Example 3. Using this process, members 
of the trimeric unit are identified as a) any rVabT+ from 
another active construct (i.e., rVabTSA+- Pep8 2 ) which 
competes with one of the two PEP lib on the original 
active rVabTSA+ - PEP 2 , b) with any rVabT+ from a third, 
active construct (i.e., rVabTSA+-Pep8 2 ) which competes 
with the other PEP on the original active rVabTSA+'PEP 2 
and c) the rVabTS+ of the original active rVabTS+ - PEP 2 . 
Competition for binding to GHR is determined by assaying 
for competition of PEP8 units expressed either attached 
to gplll coat protein and presented as phage displayed 
entities or as soluble fusion proteins with labelled 
rVabTx+- Reporters which are made as described above in 
example 2 and 3. After identification of all three of the 
active trimer members, each rVab member of the active 
timeric unit is then cloned minus its Pep8 library 
member (s), expressed, isolated and converted to a 
rVab-REPORTER , as detailed in Example 1, and used to 
establish competitive binding assays which .then find 
competing SOMERs (i.e., Somer-T,, T r or T 3 ) . In the final 
stage covalent linking of the three- Somer-Ts) is done so 
as to construct the active multimer, in this case a 
TRISOMER (i.e., T,-T a -T,) f substitute for the native 
hormone. In these systems, an additional receptor 
activation assay system is available for heterooligomeric 
receptor activation which monitors the induction of 
identifiable holoreceptor induced cellular responses by 
preformed soluble complexes of hormone and one of the 
receptor subunits in response to the binding of these 
complexes to intact cells expressing the other subunit(s) 
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of the active receptor complex and formation of active 
holoreceptor complexes (Taga, Hibi et al. 1989). 

In these systems, an additional receptor activation 
assay system may be used to confirm heterooligomeric 
receptor activation. Such systems monitor the induction 
of identifiable cellular responses induced by the 
combination of preformed soluble complexes comprising 
hormone and one of the receptor subunits and intact cells 
expressing the other subunit(s) of the active receptor 
complex and the subsequent formation of active complete 
holoreceptor complexes (Taga, Hibi et al. 1989). 

The following Table lists exemplary ligands and 
heterooligomeric receptor systems for which this invention 
provides a means for identifying their pharmacologic 
target sites as well as SOMERS or DISOMERS. 

Interleukinl Immune System Supression/Stimulation Agonist/Antag. 

11*2-7, 9-11 Immune System Supression/Stimulation Agonist/Antag. 

Insulin Like 
Growth Factors: 



TGFbetas 

GCSF 
CSF 



Erythropoietin 

GMCSF 

PDGF 
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EGF 
FGF 

NGF 

Small Organic 
Molecules 



Neoplasias 
Erythr opoi esis 

Granulopoiesis 

Wound Healing (Matrix proteins) 
Inflammation 
Carciogenesis 
Autoimmune Disease 
Chemotherapy 
Bone Marrow Transplation 
Bone Marrow Failure Syndromes 
( re : radiation/chemotherapy) 
Inflammatory 

Neoplasms (acute myeloid leukemia) 

Hematopoiesis (anemias) 

Immune Suppression/Stimulation 

Wound Repair 

Angiogenesis 

Vasoconstriction 

Atherosclerosis 

Neoplasms 

Pulmonary Fibrosis 

Inflammatory Joint Diseases 

Wound Repair 

Neoplasms 

Neoplasms 

Wound Repair 

Angiogenesis (Capillary Blood 

Vessels) 
AntiNeurodegenerative Diseases 
(Acute/Chronic) ; (Peripheral /Central) 

Neurotransmitters 

i.e. Cholinomimetics (ACh © 



Antagonist 
Agonist (synergi 

w Epo) 
Agonist (synergi 

w GMSCF) 
Agonist 
Antagonist 
Antagonist 
Antagonist 
Agonist 
Agonist 

Agonist 

Antagonist 

Antagonist 

Agonist 

Agonist/Antag. 

Agonist 

Antagonist 

Antagonist 

Antagonist 

Antagonist 

Antagonist 

Antagonist 

Agonist 

Antagonist 

Antagonist 

Agonist 

Antagonist 
Agonist 
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mReceptor 1-5) Agonist /Ant ag 

Transporter/Channel Regulators Agonist/Antag 
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I CLAIM: 

1. A method of identifying a ligand capable of 
binding to at least one determinant of a biologically 
active site on a target, which determinant participates in 
conferring biological activity of said target, the method 
^ comprising : 

a) providing at least one reporter 
antibody to be used as a reporter of binding of said 
ligand to the biologically active site, and wherein said 
antibody is selected from an antibody library of 

*0 sufficient diversity to possess at least one antibody 

member capable of binding to at least one determinant in 
the biologically active site as determined by the ability 
of said antibody member, either alone or in combination 
with at least one other ligand, to possess agonist or 

15 antagonist activity; 

b) identifying as potential ligands for 
activity at the target, those ligands which are capable of 
competing with the reporter antibody for binding to the 
target . 

20 2. The method according to claim 1 wherein the 

reporter antibodies are members of a recombinant library 
wherein each antibody member (rVab) of the recombinant 
library comprises at least one variable region selected 
from the group consisting of VH and VL regions, and 

25 optionally comprising a constant domain attached by its 
amino terminus to the variable region, 

3 . The method according to claim 2 wherein the 
rVab unit is displayed on the surface of a carrier. 

4 . The method according to claim 2 wherein the 
30 rVab unit is soluble. 

5 . The method according to claim 3 wherein the 
carrier is a bacteria. 

6 . The method according to claim 3 wherein the 
carrier is a bacteriophage. 

35 7. The method according to claim 2 wherein a 

parental VL region comprising at least one CDR is used to 
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derive the VL region of the rVab by deleting, inserting or 
substituting at least one amino acid within at least one 
CDR. 

8. The method according to claim 2 wherein a 
parental VH region comprising at least one CDR is used to 

5 derive the VH region of the rVab by deleting, inserting or 
substituting at least one amino acid within at least one 
CDR . 

9. The method according to claim 2 wherein 
parental VL and VH regions comprising at least one CDR, 

10 are used to derive a pair of VL and VH regions of a rVab 
by deleting, inserting or substituting at least one amino 
acid within at least one CDR of each variable region. 

10. The method according to any one of claims 
7, 8 or 9 wherein the crystal structure of the parental V 
regions used to derive rVab are known. 

11. The method according to claim 9 wherein the 
crystal structure of the parental VH and VL pair used to 
derive the rVab is known. 

12. The method according to claim 2 wherein at 
20 least one of the parental V regions used to derive rVab is 

unmodified. 

13 . The method according to claim 2 wherein the 
crystal structure of the rVab is determined after 
isolation as a rVab which binds to a biologically active 
site on the target . 

14 . The method according to claim 2 wherein at 
least two V regions are modified by deleting, inserting or 
substituting at least one amino acid in at least one CDR 
after isolation as rVab which binds to a biologically 
active site on the target . 

15. The method according to claim 1 wherein the 
target is a polypeptide, protein, nucleic acid, 
oligosaccharide, carbohydrate or lipid. 

16 . The method according to claim 1 wherein 
activity of the target is coupled to an assayable 
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° biochemical response at the target which biochemical 
response functions as a signal of target activation. 
'7 17. The method according to claim 16 wherein 

the biochemical response is detectable as a change in a 
protein or polypeptide characteristic. 
5 18. The method according to claim 16 wherein 

the biochemical response is associated with an 
organometallic moiety, a metal or other nonprotein. 

19. The method according to claim 16 wherein 
the biochemical response is associated with a portion of 

10 the bioactive structure. 

20. The method according to claim 16 wherein 
the biochemical response comprises a detectable free 
radical, fluorescent or chemiluminsecent group, 
radioactive isotope or involves oligomerization. 

15 21. The method according to claim 16 wherein 

the biochemical response is phosphorylation and the signal 
is a change in the phosphorylation state of the target. 

22. The method according to claim 17 wherein 
the signal protein is a G protein and the signal is a 

20 change in either the prepense of a G protein regulatory 
agent or the binding of rVab due to the presence of a G 
protein regulatory agent. 

23. The method according to claim 16 wherein 
the signal is a change in the binding of rVab to its 

25 binding site. 

24. The method according to claim 2 wherein the 
recombinant antibody comprises a single polypeptide chain 
comprising a VH functionally coupled to a VL to produce a 
binding site. 

30 25. A method of identifying ligands capable of 

binding to at least two determinants which together are 
required for biological activity of a pharmacological 
target, the method comprising: 

a) screening and isolating from an rVab 

35 library, rVab members comprising at least one VH and VL 

regions, and optionally comprising a constant domain 
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attached by its amino terminus to the V region, and 
capable of binding to at least one of the determinants of 
the pharmacological target; 

b) making and expressing an rVab-peptide 
(rVab-PEP) library comprising the isolated rVab members 
coupled to at least one peptide comprised of a random 
sequence of amino acids; 

c) screening the rVab-PEP library for 
first rVab-Pep members which bind and activate the 
pharmacological target wherein the rVab component binds to 
a first determinant of the pharmacological target and the 
peptide component binds to a second determinant of the 
pharmacological target ; 

d) screening the rVab-Pep library and 
identifying a second rVab-pep member capable of actively 
binding to the pharmacological target, and wherein the 
rVab component binds to a third determinant of the 
pharmacological target and the peptide component binds to 
fourth determinant of the pharmacological target. 

26. The method according to claim 25 wherein 
the rVab component of the second rVab-Pep member competes 
with the peptide component of the first rVab-Pep member 
for binding to a determinant on the pharmacological 
target . 

27. The method according to claim 25 wherein 
the rVab component of the first rVab-Pep member competes 
with the peptide component of the second rVab-Pep member 
for binding to a determinant on the pharmacological 
target . 

28. The method according to claim 25 wherein 

30 the first determinant of the pharmacological target is the 
same as the fourth determinant, and wherein the second 
determinant of the pharmacological target is the same as 
the third determinant . 

29. The method according to claim 25 wherein 
35 the rVab component used to construct the rVab-Pep has at 

least one other attribute of an active ligand, besides 
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affinity for the target, and wherein the attribute is 
selected from selectivity and biological activity. 

30. The method according to claim 29 wherein 
rVabs which bind to determinants of active sites are 
identified by their ability to competitively or 

3 allosterically alter the binding on an endogenous ligand. 

31. The method according to claim 25 wherein 
the active rVab-Pep possess agonist or antagonist 
activity. 

32. The method according to claim 31 wherein 
10 activity of the target is coupled to an assayable 

biochemical response at the target which biochemical 
response functions as a signal of target activation. 

33. The method according to claim 32 wherein 
the biochemical response is detectable as a change in a 

15 protein or polypeptide characteristic. 

34 . The method according to claim 32 wherein 
the biochemical response is associated with an 
organometallic moiety, a metal or other nonprotein. 

35. The method according to claim 32 wherein 
20 the biochemical response is associated with a portion of 

the bioactive structure. 

36. The method according to claim 32 wherein 
the biochemical response comprises a detectable free 
radical, fluorescent or chemiluminsecent group, 

25 radioactive isotope or involves oligomerization. 

37. The method according to claim 32 wherein 
the biochemical response is phosphorylation and the signal 
is a change in the phosphorylation state of the target. 

38. The method according to claim 33 wherein 
30 the signal protein is a G protein and the signal is a 

change in either the prepense of a G protein regulatory 
agent or the binding of rVab due to the presence of a G 
protein regulatory agent. 

39. The method according to claim 32 wherein 
35 the signal is a change in the binding of rVab to its 

binding site . 
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40. The method according to claims 25 wherein 
the peptide component of the rVab-Pep members comprising 
VH and CL regions are expressed attached to either or both 
of the amino terminus of VH and the carboxy terminus of 

41. The method according claim 40 wherein the 
peptide component is attached to the amino terminus of the 
VH region. 

42. The method according to claim 4 0 wherein 
the peptide component is attached to the carboxy terminus 
of the CL region. 

43. The method according to claim 40 wherein 
two peptides are attached to the rVab component to form 
rVab-Pep 2 . 

44. The method according to claim 40 wherein the 
peptide comprises between about 5 and 50 amino acids. 

45. The method according to claim 44 wherein 
the peptide comprises between about 7 and 25 amino acids. 

46. The method according to claim 45 wherein 
the peptide comprises about 8 amino acids. 

47. A reporter of binding of a ligand to a 
determinant of a pharmacological target, which target 
requires binding of ligand to at least two determinants of 
said target to produce a biological response, said 
reporter comprising an rVab portion of an active rVab-Pep, 
and wherein said rVab component of said rVab-Pep binds to 
a first determinant of said target,, and the peptide 
component binds to a. second determinant of said target. 

48. The reporter of claim 4 7 wherein tne rVab 
comprises VH and CL regions and the peptide is expressed 
bound to either or both of the amino terminus of the VH 
and the carboxy terminus of the CL. 

45- The reporter according claim 4 8 wherein the 
peptide component is attached to the amino terminus of the 
VH region. 
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50. The reporter according to claim 4 7 wherein 
the peptide component is attached to the carboxy terminus 
of the CL region. 

51. The method according to claim 47 wherein 
two peptides are attached to the rVab component to form 
rVab-Pep 2 . 

52 . The method according to claim 47 wherein the 
peptide comprises between about 5 and 50 amino acids. 

53. The method according to claim 52 wherein 
the peptide comprises between about 7 and 25 amino acids. 

54 . The method according to claim 53 wherein 
the peptide comprises about 8 amino acids. 

55. A method of identifying a ligand capable of 
binding to at least one determinant of a biologically 
active site on a target, which target requires activation 
of at least two determinants to express biological 
activity of said target, the method comprising: 

a) providing at least one rVab reporter 
antibody according to claim 47 to be used as a reporter of 
binding of said ligand to the biologically active site, 
and wherein said antibody is selected from an antibody 
library of one antibody member capable of binding to at 
least one determinant in the biologically active site as 
determined by the ability of said antibody member, either 
alone or in combination with at least one other ligand, to 
possess agonist or antagonist activity; 

b) identifying as potential ligands for 
activity at the target, those ligands which are capable of 
competing with the reporter antibody for binding to the 
target . 

56. The method according to claim 55 wherein 
multiple ligands are identified which when bound together 
covalently, are capable of binding to the determinants 
necessary to cause a bioligical response of the target, 
the method comprising: 
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a) providing reporter rVab antibodies for each 
of the determinants for which ligands are to be 
identified; 

b) for each of the rVab reporter antibodies, 
identifying as potential ligands for activity at each of 
the determinants of the target, those ligands which are 
capable of competing with each of the rVab reporter 
antibodies for binding to the target; 

c) covalently linking the identified ligands so 
as to form active multivalent ligands capable of 
activating the pharmacological target. 

57. The method according to claim 56 wherein 
the identified ligands are non-protein organic molecules. 

58. The method according to claim 56 wherein 
the two rVab reporter antibodies are used to identify two 
ligands which are combined to form the multivalent active 
ligand. 

59. The method according to claim 56 wherein 
the pharmacological target is a polypeptide receptor. 

60. A recombinant rVab antibody library 
comprising rVab members possessing at least one VL or VH 
region derived from a parental variable region with at 
least one CDR which is diversified to form different rVab 
members by deleting, inserting or substituting at least 
one amino acid within at least one CDR. 

61. The recombinant antibody library according 
to claim 60 wherein a parental VH region comprising at 
least one CDR is used to derive the VH region of the rVab 
members by deleting, inserting or substituting at least 
one amino acid within at least one CDR. 

62. The recombinant antibody library according 
to claim 60 wherein parental VL and VH regions comprising 
at least one CDR, are used to derive a pair of VL and VH 
regions of rVab members by deleting, inserting or 
substituting at least one amino acid within at least one 
CDR of each variable region. 
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63. The recombinant antibody library according 
to any one of claims 60 # 61, or 62 wherein the crystal 
structure of the parental V regions used to derive rVab 
members are known. 

64. The recombinant antibody library according to 
claim 60 wherein the crystal structure of the parental VH and 
VL pair used to derive the rVab members is known. 

65. The recombinant antibody library according to 
claim 60 wherein at least one of the parental V regions used 
to derive rVab is unmodified. 

66. The recombinant antibody library according to 
claim 60 wherein the CDR regions of a specific antibody are 
expressed on a plurality of frameworks which provides for 
variable geometric orientation of the CDR regions. 

67. The recombinant antibody library according to 
15 claim 60 wherein the rVab members further comprise a peptide 

sequence covalently bound to the rVab members to form rVab-Pep 
members . 

68. The recombinant antibody library according to 
claim 67 wherein the peptide component of the rVab-Pep members 

20 comprising VH and CL regions are expressed attached to either 
or both of the amino terminus of VH and the carboxy terminus 
of CL. 

69. The recombinant antibody library according 
claim 68 wherein the peptide component is attached to the 

25 amino terminus of the VH region. 

70. The recombinant antibody library according to 
claim 68 wherein the peptide component is attached to the 
carboxy terminus of the CL region. 

71. The recombinant antibody library according to 
30 claim 68 wherein two peptides are attached to the rVab 

component to form rVab-Pep 2 . 

72 . The recombinant antibody library according to 
claim 68 wherein the peptide comprises between about 5 and 50 
amino acids . 
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73. The recombinant antibody library according to 



claim 72 wherein the peptide comprises between about 7 and 25 
amino acids. 

74 . The recombinant antibody library according to 
claim 73 wherein the peptide comprises about 8 amino acids. 

75. A method of providing a model for a ligand 
capable of binding to a determinant of an active site of a 
pharmacological target, the method comprising: 

a) providing at least two rVab identified as binding 
to an active surface of a pharmacological target; 

b) identifying the regions of the rVabs that bind 
the biologically active site or individual inactive surface 
determinants of the bioactive structure; 

c) grouping the rVabs by overlapping structures 
which bind to common epitopes; 

d) determining the relative spatial orientation, 
charge and energetics of the identified binding sites 

e) determining the molecular structure necessary to 
bind the target and confer activity. 
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rCL 



_ ?1 lf p J [XHLWC] 
[rs3o) 



[rs4] 



pVHACCCPTOR _ JvHl-15 L_F^ 
<pVH-CHl> |> \ » 



rCHl 



JrsfO! 



— .Sfil.Hcol, — ;Xhol; — ; 



[rs4'] 



pVHACCEPTOR 



---;Notl; X 



r.b.s~ pelB leader 
_Z 



pEXPRESSORrVab 



Apal 
r.b.s- pelB leader . 



piED 1 ■ CH| [ 



EKLNK 1S0TAG 

^7T 



[ 



rCL 



Notl < 2 Terminators) 



EKLNK ISOTAG 



pEXFRESSORrVab-I 



pEXPRESSORrVab.il 



(Sfil] >[Ncol) 



[AscI] (2 Terminators) 



Fic.9. 
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rVLlib PRIMER CONSTRUCTION 





L1.1FW0: 19-24ASSSVCNKW)2M3435-rsB-38-rs4 


63mep 


ACDR 


I. 


L1.2FWD: 19- 


■24ASQSICNNN)222L34-rsB-38-rs4 


66n>er 


ACDR 




L1.3FWD: 19-24SSESL(NNN)232L34-rsB-3S-rs4 


6€ner 


aCDR 


4, 


L1.4FWD: 19- 


•24SQSSLCNNN)232L34-rsB-3a-rs4 


66ffter 


aCDR 


S. 


Ll.SFWD: 19- 


•24GSESD(NNN)2SNGNT32L34-rsA-36-rs< 


wiicr 




6. 


L1.1-3BCK: 


prs2-prs2\2CI)3-9 


3&HCP 


APPEND 


7. 


L1.4BOC: 


. .prs2-Drs2\ 2003-9 


36mer 


APPEND 


8. 


L1.5BCK: 


...prs2-DPs2\2(S)3-9 


■*» r 

3tmer 


APPEND 


9. 


L1ALLFWD: 


34-rsB-44-prs5 


54mer 


APPEND 


10. 


L1ALLBCK: 




•lamer 


AUDI TCV 


11. 


L271YFWK: 


63-71CY>rsC-76-pps5(10) 


49mer 


APPEND 


12. 


L271YFWK: 


63-71CF>psC-76-pps5C10> 


49mcp 


APPEND 


13. 


L271YFWK: 


63-71(A)-rsC-76-pPs5(10) 


49mer 


APPEND 


14. 


L2ALLBCK: 


38-rsB-48CI>49CNNN)252-58 


CBrjcp 


ACDR 


15. 


L1-8ALLBCK: 




49mer 


AMPLIFY 


16. 


L3.1FWD: 


84-89Q 91 C NNN) 294P9697 - 100 


54mer 


ACDR 


17. 


L3.2FWD: 


8A-89Q 91CNNN)2P9S9697-100 


54mer 


aCDR 


18. 


L3.3FWD: 


84-89Q 91CNNN)294P97-100 


54<nep 


ACDR 


19. 


L3.4FW0: 


84-899091(NNN) 294959697- 100 


54mer 


aCDR 


20. 


L3.5FWD: 


84-89909l9293CNNN)294959697- 100 


54mcp 


ACDR 


21. 


L3.6FWDV 


84-89909192(NNN)294959697-100 


54mer 


ACDR 


21. 


L3ALLBCK: 


prs6-72-rsC-76-82 


48mep 






LJCLLNKFWD: 


95-100-rsC-110-Ps4 


Slmep 


APPEND 


23. 


CLFWO: 


209-psA* -216Crs4)-pps5 


36mer 


APPEND 


24. 


CLBCK: 


pp s0- 105- 107CCLLNK) - 1 10- 1 16 


45mer 


APPEND 


rVHlib == 


IWER CONSTRUCTION 






25 


5*VHF«0: 


40-51-rs3-pUC 


54mep 


APPEND 


26 




ppsl-lCprs2>7 


30mer 


AMPLIFY 


27 


H1.1BCK 


17-psB-23A*2SG- F«28F* 30CNNN)3233M* 35-40 


63mep 


ACDR 


28 


H1.2BCK 


17- psB-23A» 25G* S' 28F • 30CNNTO 323 3W 35-40 


63mer 


aCDR 


29 


H1.3BCK 


17-PsB-232425G # Y»2SF*30fNWO3l23233W 35-40 66mer 


\CDR 


30 


HIALLFWrD 


pCFWD- pCLONALLFWD (see 




AMPLIFY 


31 


H31FVKD: 


100-104-rs3°-rs3(CHltNK)-rs3* -prs4 


39mer 


APPEND 


32. 


H31BCK : 


PC-17-PSB-24 


30mer 


APPEND 


33 


H2.1FWD 


474849505152(NNN)54G # 56-rsD-S9. . . 


45mer 


ACDR 


34 


H2.2FWD 


474849505152P»(NWO54G*56-rsD-59. . . 


48mer 


ACDR 


35 


H2.3FWO' 


474849505152P»(NNN)G*F 56-rsD-59. . . 


48mer 


ACDR 


36 


H2ALLBCK 


!5-24pC 


36mer 


AMPIIFV 


37 


3'VHR©: 


89-95-ps5-pCFWD 


30mer 


AMPLIFY 


38 


3'V>«CK: 


Pps2-56-psD* -59-65 


39mcr 


AMPLIF> 


39 


H3.5FWD: 


89-95<NNN)3DY-Ps3°-104 


39mer 


ACDR 




H3.7FVTO: 


89-95G(NNN}Y(NNN)0(NNN)0G- rs3°-104 


45mer 


ACDR 


41 


H3.10FVYD: 


89-95Y(NNN)S(NNN)P(NNN )YFDY - r s 5°- 104 


54mer 


ACDR 


SEQUENCI 


\G PRIMERS 






SEQ. 


42. 


pCFWD 


DUCFWD = oCLONALlFWD 




43. 


pCBCK 


pUCBCK = pCLONALLBCK 




SEQ. 
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A, amplified and digested fragments < rVLCUib > from pVLACCEPTOR.lib 
ApaLI 



loxF 



fde bas«J > 




ApaLI 



Ascl HtndUI sfil 



region to be % 

LoxREC 



B« amplifed and digested fragments trVHCHUib) from pVHACCEPTOR.Iib 




LoxPRO 



Individual VHCH1 and VLCL within a bacterium are recombtned in vivo (X) by Cre recombinase 



ApaLI rVLClJib m a^i 



* region to be * 
/ exchanged \ 



lexP 



recombination 
in vivo 




loxPSll 



rVHCHl.lib 
^ ? 



Hindtll 




Shi Ncol 



Notl 



LoxREC 



LoxPRO 



^rVi n M h m .rVHCHl.lib 

-loxP floxP 
| | 511 

ApaU J^ indm sT C ° l NoO 



in 



id <p rVab CARRIER 



FIG. 11 
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A. amplified and digested fragments < rVHCHUib t from pVLACCEPTORJib 




Tfd^RECEIVER 



/ 

Ascl HifMlDJ 



B. amplifed and digested fragments < rVT n.iib > from pVH ACCEPTOR. lib 



UC19 PROVIDER 




ApaLl 



Ascl 



C* Individual VHCH1 and VLCL within a bacterium are recombined in vivo (X) by Cre necombinase 

rVLCLJib 

^rVHCHUib 

ILnkr 




D. 



Universal IdBCK 
Primer sequence 



LpeJB 

Sfil 



fd+RECEIVER 

x rVHCHlJib 
\ member / 
» lEK 



Ascl Hindlll 



x rVLCLJib • 
\ member / 



ApaLl Ascl 
pUC19 PROVIDER 



loxF 



iv *XhoJ I - 




fda rVabPEP CARRIER 



Noll 



2su + codons A P* L1 



E. 



ICCCGSlUnker 



I PEP8UB) VH ™ 
\ rsPELBI 

riiner VHUCRPETBCK 



Ascl Hindlll 
ICL208-21&I 



Universal fdFVVD 
Pnmer se qu ence. 



|CCGCS|Linker 




CLLKRPtrrWD 



i. Making rVab-PEP 1 Lib 

with Peptt attached to Ammo Term nia of VH use FWD pnmcr Universal fdFWD and VHLXRPEPBCK pnmer 

ii. Making rVab-PEP 1 Lib 

with Pep* attached to Carbovy Terminus of CL use FWD primer CLLKRPEPFWD and Universal fdBCK pnmer 
tii. Making rVab-PPEP* Lib 

with a Peptf attached to Amino Terminus of VH and to the Carbocv Terminus of CL use FWD primer CLLKRPEPFWD and 
BOC pnmcr VHLKRPEPBCK 



FIG. 13. 



WO 96/04557 PCI7US95/101M 



14/25 




j.1=N 



(VH„CH1 H : VL m CL) 



LEGEND: 




rVH n CH1 .Lib member 




fd plasm id carrying rVH n CH1 Lib member 




rVL^CL. Lib member 




mature q> carrying rVL^CL phagmid 




gplll § coat protein 




VHpCHI-gpM fusion protein 



Fig. 14. 
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Parental Fab 



rVab 



1 



rVab.Lib 



Initial Active. 



(rVabT+S+A+.LIb) 



CONSTRUCTION 

CSR&CDRl 0 
Diversification 

SELECTION 
of 

1° Attributes 
Library 



with Active Lib of 
Large Size 




with Active Ub of 
Limited Size 



1° Simplification 
of 

Actlvclib 

GROUPING by: 
PGR Characterization 
Attribute Characterization 
Quantity and Quality 



>nd Diversification 
of 

Active. Ub 



REMOVAL OI CDR/CSR 
VI. alone 
VH atone 

individual C3R/CDRH3 



SELECT BEST CI I ANG ES 



ADDITIONAL MUTACENSIS 
Saturation (all CDRs/CSRs) 
Selective 
(one CDR/CSR) 
(one position within a CDR/CSR) 



COMBINATION OF BESTC! I ANG ES 



FIG. 15. 
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A. Isolation for Target Recognition (T+) 

free rVabT"(j) 



*<{) Lib. / / isolated 



removes Wfree rVabT'tf) 



B. Isolation for Target Specificity and/or Selectivity (S + ) 

rVabS'Cfr 
free com-T-pep complexes 

^ rVabS^ Ub. — /'' ** ( m^fS s isolated 

com-T-pep— -f^VabS** rV«H> 1 



removes free 



\1Y rVabS*^ 
ey com-T-pep 



pep complexes 



LEGEND: rVab<J> - phage displayed rVab; Lib » library; 

&X»§ = Target (T) bound to matrix (here plastic dishes) 
com-T-pep » none-Target entity (here peptide) with 
undesrred common surface epitopes 



FIG. 16. 



# • 



WO 96/04557 PCT/US95/101S2 



18/25 




WO 96/04557 



PCTAJS95/10182 



19/25 



©WGA:RG + rVabT 7 S 7 A 7 .Lib 
|LowSoMdSpin(LSS) 

T~ — * 



rV«bT"S ? A ? 



©WGA:RG:rVabT*S ? A ? -^-J 
R«"P«j|+ GTP 

* — - — T 



amplify 
and recycle 



©WGA:RG»VabT + S 7 A 7 rVabT*S 7 A+ I _ 

sepnsdax gso: ~l 



amplify 
and recycle 



perenzeptne/antagontsts 



©WGA;RG 



©WGArRG 



"{ ^ 

©WGA:RG:rVabT + S*A+ 



tractive Surface 
Re 



com ituQMOon or 
♦ GTPbm I 



§>*GA:RG:rVabT 7 S 7 A + 



amplify 
land recycle 

I 



free rVabT + A+S 7 
Competitive- Antagonist-Like 



rVabT*S?A*-±_ _ _ 

SeortadaxG S0 I 

or CantnfuQSDon ■ 



Allosteric-Aoonist-Like T 
©rVGA:RG 'VfcibT+S r Ap /r + ▼ 
^ K/b ©WGA:R:rV 



amplify $ 
and recycle 



;rVabT*S 7 A* free rVabT+S 7 A*-J 



LSS 



Partial-Agonist-Like 

© 

«. CD'-"'" 

Q^GA:GR m1 ^solubl e GR^^s 



Agonist-Like 



rV a bT-S mT +A^| 



€V(GA:R:rVabT*S*A* 



amplify (J 

and recycle I 

^Su mniamn | 

G«m2.3.4.5 : ' v » bT+ S m ,-A- , 



FIG. 19 
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Isolate Tl+rVab 



® - T l +|rVabT ? S ? A ? $ L0>. + comp-T-receptor x 
"jwashes (W) 



discard supernatant 



atrix 



Domain T 1rif0 _ Grouping: © -t, - n , plus domain peptide n,o.. 



Matrix- 



Display Pep8 Ub. 
Pep8T 2 ? $.Ub - 

•-►PepBT^+i 



incubate and spin 
■ supernatant 



p8T 2 + 4 Ub 



Combinatorial Bivalent 
T1-2 Recognition Lib. 





rVabT ln +S*A ? 






r 






rVabT 1n + S+A ? — Pep8 T2 + <J Ub. 



PAN for rVab-Pep which 

form ACTIVE Ti-T 2 DIMER 

' u add 

free Tj & free T 2 



Matrix 



add 

(m^nonmterfenng AbT-j and test 
for fluoresence transfer between 
Tl & T2 



discard supernatant 



rVabT ln +S+A*-Pep8 T 2+ ! 



FIG. 20- 
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rVabT 1n +S + A+-Pep8 T2m + [A+] 


Obtained as rVabT2m+ -Pep8T1 +[A+] 

which competes with Pep8T2m+ } 


t 


i ^ 


rVabT 2m +S + A* rVabT^ n + S + A* 





Label [*] j 



^*]rVabT 2 m + S+A* 


[*JrVabT ln S + A* 


Search CHEMFILE 
for Competitive 
SOMERs , 


Use as 
Reporters 

r i 




SOMER-T 2m 


SOMER-T 1n 



Link SOMERs covalently 
Form multi-Target surface 
interactors: T1 n-T2m 



mn 




FIG. 21 



WO 96/04557 



PCT/US95/10182 



22/25 




WO 96/04557 



PCTAJS95/10182 



23/25 



i SEQUENCES of rT+S + A+ Vab act } 



MODELLING 



By PCR for the set of Active 
Surface Scanners (rVabTSA*) 
of a defined drug target domain 



Known loop structures 
Known framework structure 



GENETIC ALGORITHM 



a£,y: - chemical attributes 
-.charge distribution 
j hydrophobicity 
I solvent accessabttity 
Lpolar network 

x.y»z. - atomic positions 



P ( a, p, Y ) = S (x, y, z) 




P - mapping of attributes 
S - every point on surface 



DIRECTED 
SYNTHETIC EFFORT 



COMPUTATIONAL 
SCREENING 




DISCOVERY LEAD 



Using known methods 
(i.e.. DOCK. HOOK 
programs) 



FIG. 23. 
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1 . For a set of TSA + rVabs for a specific Target: 

Determine the General Orientational Matrix for each attribute Rj 

R = Generalized orientational matrix [ <J>, V, a> ] 

mapping simitar attributes 
a, p t y = Chemical and structural attributes 

[x,y,z, a, y,..]j ► [ X +, y + z + ^ p f T fM]j 

x+, y+, z+ = Rj 4>, u>) ( X> y, z) 



2. Find the set of Rj's that minimizes some target function of <x p, y 
without atomic clashes 

Example: overlap arginine 
guantdinium ( • ) groups 



COR 




VheaVY CSRS+CDRH3 - €IB V UGHT CSRS = C 



Fig. 24. 
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1. After obtaining the first {attribute lj; i.e., { R }j, 

Repeat process for hydrophobicity ; i.e., { H }j 

Search for the overlap of the 

{H}j of methyl groups with the {RJj of arginines 

Now use {R{j (£) {H}j as good predictor of other 
overlaps for the other sets of chemical attributes 



2. 
3. 



Iterate process; eliminate 'outliers' and derive a single, 
overlapping neighborhood Active Surface Scanner surface 

S« {R® H(X)™.Z } j=lfN 
this is the BEEP 9 

i.e., the Biological Enhanced Ensemble Pharmacophore 
Model of a 2D-BEEP 



pocket 



polar ridge 




hydrophobicity 



arginme 



Fig. 25. 
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